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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Novartis AG 

(B) STREET: Schwarzwaldallee 215 

(C) CITY: Basel 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP): 4058 

(G) TELEPHONE: +41 61 324 1111 

(H) TELEFAX: + 41 61 322 75 32 

(ii) TITLE OF INVENTION: Rifamycin biosynthesis gene cluster 
(iii) NUMBER OF SEQUENCES: 9 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1,0, Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5676 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GGTACCCGGT GTTCGCGACG GCGTTCGACG AGGCTTGCGA GCAGCTGGAC GTCTGTCTGG 
CCGGCCGTGC CGGGCACCGC GTGCGGGACG TCGTGCTCGG CGAAGTGCCC GCCGAAACCG 
GGCTGCTGAA CCAGACGGTC TTCACCCAAG CCGGGCTGTT CGCGGTGGAG AGCGCGCTGT 
TCCGGCTCGC CGAATCCTGG GGTGTCCGGC CGGACGTGGT GCTCGGCCAC TCCATCGGGG 
AGATCACCGC .CGCGTATCCC GCGGGCGTCT TCTCGCTGCC GGACGCCGCC CGGATCGTCG 
CGGCGCGCGG CCGGCTGATG CAGGCGCTGG CGCCGGGCGG GGCGATGGTC GCCGTCGCCG 
CCTCCaiLAGC CGAGGTGGCC GAACTGCTCG GCGACGGCGT GGAACTCGCC GCCGTCAACG 
GCCCTTCGGC GGTAGTCCTT TCCGGGGACG CGGACGCGGT CGTCGCGGCC GCCGCCCGCA 
OXSCGCGAGCG CGGGCACAAG ACCAAGCAGC TCAAGGTTTC GCACGCGTTC CACTCCGCGC 
GGATGGCGCC GATGCTGGCG GAGTTCGCCG CCGAGCTGGC CGGCGTGACG TGGCGCGAGC 
CGGAGATCCC GGTGGTCTCC AACGTGACCG GCCGGTTCGC CGAGCCCGGC GAACTGACCG 
AGCCGGGCTA CTGGGCCGAG CACGTGCGGC GGCCGGTGCG GTTCGCCGAG GGCGTCGCGG 
CCGCGACGGA GTCCGGCGGC TCGCTGTTCG TGGAGCTCGG GCCGGGGGCG GCGCTGACCG 
CCCTCGTCGA GGAGACGGCC GAGGTCACCT GCGTCGCGGC CCTGCGGGAC GACCGCCCGG 
AGGTCACCGC GCTGATCACC GCGGTCGCCG AGCTGTTCGT CCGCGGGGTT GCGGTCGATT 
GGCCGGCCCT GCTGCCGCCG GTCACCGGGT TCGTCGACCT GCCGAAGTAC GCCTTCGACC 
AGCAGCACTA TTGGCTGCAG CCCGCCGCGC AGGCCACGGA CGCGGCCTCG CTCGGGCAGG 
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TCGCGGCCGA CCACCCGCTG CTGGGCGCGG TC3GTCCGGCT GCCGCAGTCG GACGGCCTGG 
TCTTCACCTC GCGGCTGTCA TTGAAATCGC ACCCGTGGCT GGCCGACCAC GTCATCGGCG 
GGGTGGTGCT CGTCGCGGGC ACCGGGCTCG TCGAGCTGGC CGTCCGGGCC GGGGACGAGG 
CCGGCTGCCC GGTCCTCGAA GAACTCGTCA TCGAGGCTCC GCTGGTCGTC CCCGACCACG 
GCGGGGTCCG GATCCAGGTC GTCGTGGGGG CACCGGGGGA GACCGGTTCG CGCGCGGTCG 
AGGTGTACTC CCTGCGCGAG GACGCCGGTG CCGAAGTGTG GGCCCGGCAC GCCACCGGGT 
TCCTGGCTGC GACGCCGTCG CAGCACAAGC CGTTCGACTT CACCGCCTGG CCGCCGCCCG 
GCGTCGAGCG CGTCGACGTC GAGGACTTCT ACGACGGCTT CGTCGACCGC GGGTACGCCT 
ACGGGCCGTC GTTCCGGGGC CTGCGGGCGG TGTGGCGGCG CGGCGACGAA GTGTTCGCCG 
AGGTCGCCCT GGCCGAGGAC GACCGCGCGG ACGCGGCCCG GTTCGGCATC CACCCCGGCC 
TGCTGGACGC CGCCCTGCAC GCGGGCATGG CCGGTGCCAC CACCACGGAA GAGCCCGGCC 
GGCCGGTCCT GCCGTTCGCC TGGAACGGCC TGGTGCTGCA CGCGGCCGGG GCGTCCGCGC 
TCCGGGTCCG GCTCGCCCCG AGCGGTCCGG ACGCCCTGTC GGTCGAGGCC GGGGACGAGG 
CCGGCGGTCT CGTTGTGACG GCGGACTCGC TGGTCTCCCG GCCGGTGTCG GCCGAACAGC 
TGGGCGCGGC GGCGAACCAC GACGCGTTGT TCCGCGTGGA GTGGACCGAG ATTTCCTCGG 
CTGGAGACGT TCCGGCGGAC CACGTCGAAG TGCTCGAAGC CGTCGGCGAG GATCCCCTGG 
AACTGACCGG CCGGGTCCTG GAGGCCGTGC AGACCTGGCT CGCCGACGCA GCCGACGACG 
CTCGCCTGGT CGTGGTGACC CGCGGCGCCG TCCACGAGGT GACTGACCCG GCCGGTGCCG 
CGGTCTCGGG CCTGATCCGG GCCGCGCAGG CGGAAAACCC GGACCGGATC GTGCTGCTGG 
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ACACCGACGG TCAAGTGCCG CTAGGCCGGG TGCTGGCCAC CGGCGAGCCC CAAACAGCCG 
TCCGR.GGCGC CACGCTGTTC GCCCCGCGGC TGGCCCGCGC CGAGGCCGCG GAGGCACCGG 
CAGTGACCGG CGGGACGGTC CTGATCTCGG GCGCCGGCTC GCTGGGCGCG CTCACCGCCC 
GGCACCTGGT CGCCCGGCAC GGAGTCCGGC GGCTGGTGCT CGTCAGCCGC CGTGGCCCCG 
ACGCCGACGG CATGGCCGAA CTGACCGCTG AACTCATCGC TCAGGGCGCC GAGGTCGCCG 
TAGTCGCTTC CXSACCTGGCC GACCGGGACC AGGTCCGGGT ACTGCTGGCC GAGCACCGCC 
CGAACGCCGT CGTGCACACG GCCGGTGTTC TCGACGACGG CGTCTTCGAG TCGCTGACGC 
GGGAGCGGCT GGCCAAGGTC TTCGCGCCCA AAGTTACTGC TGCCAATCAC CTCGACGAGC 
IXaCCCGCGA ACTCGATCTT CGCGCGTTCG TCGTGTTCTC CTCCGCCTCC GGGGTCTTCG 
GCTCCGCCGG GCAGGGCAAC TACGCCGCTG CCAACGCCTA CCTGGACGCC GTGGTCGCCA 
ACCGCCGGGC CGCGGGCCTG CCCGGCACAT CGCTGGCCTG GGGCCTGTGG GAACAGACCG 
ACGGGATGAC CGCGCACCTC GGCGACGCCG ACCAGGCGCG GGCGAGTCGC GGCGGGGTCC 
TCGCCATCTC ACCCGCCGAA GGCATGGAGC TGTTCGACGC AGCGCCGGAC GGGCTCGTCG 
TCCCGGTCAA GCTGGACCTG CGCAAGACCC GCGCCGGCGG GACGGTGCCG CACCTGCTGC 
GCGGCCTGGT CCGCCCGGGA CGGCAGCAGG CCCGTCCGGC GTCCACTGTG GACAACGGAC 
TCGCCGGGCG ACTCGCCGGG CTCGCGCCGG CGGAGCAGGA GGCGCTGCTG CTCGACGTCG 
TCCGCACGCA GGTCGCGCTG GTGCTCGGGC ACGCCGGGCC GGAGGCCGTC CGCGCGGACA 
CGGCGTTCAA GGACACCGGC TTCGACTCGC TGACGTCGGT GGAACTGCGC AACCGGCTGC 
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GCGAGGCGAG CGGGCTCAAG CTQCCCGCGA CGCTCGTCTT CGACTACCCG ACGCCGGTCG 
CGC1GGCCCG CTftCCTCCGT GACGAATTCG GCGACACGGT GGCAACAACT CCGGTGGCCA 
CCGCGGCCGC AGCGGACGCC GGCGAGCCGA TCGCCATCGT CGGCATGGCX3 TGCCGGCTGC 
CGGGCGGGGT CACCGATCCC GAAGGCCTGT GGCGCCTGGT GCGCGACGGC CTCGAAGGQC 
TGTCTCCCTT CCTCGAGGAC CGGGGCTGGG ACCTGGAGAA CCTGTTCGAC GACGACCCCG 
ACCGCTCCGG CACGACGTAC ACCAGCCGGG GCGGGTTCCT CGACGGCGCC GGCCTGTTCG 
ACGCGGGCTT CTTCGGGATT TCGCCGCGCG AGGCGCTGGC CATGGftCCCG CAGCAGCGGC 
TGCTGCTCGA GGCGGCCO^ GAAGCCCTCG AAGGCACCGG TGTCGftCCCG GGCTCGTTGA 
AGGGCGCCGA CGTCGGGGTG TTCGCCGGGG TGTCCAACCA GGGCTATGGG ATGGGCGCGG 
ATCCGGCCGA ACTGGCGGGG TACGCGAGCA CGGCGGGCGC TTCGAGCGTC GTCTCGGGCC 
GRGTCTCGTA CGTCTTCGGG TTCGAAGGAC CGGCGGTCAC GATCGACACG GCTTGCTCGT 
CG1CGCTGGT GGCGATGCAC CTCGCCGGGC AGGCGCTGCG GCAGGGCGAG TGCTCGATGG 
CCCTGGCCGG TGGCGTCACG GTGATGGGGA CGCCCGGCAC GTTCGTGGAG TTCGCGAAGC 
AGCGCGGCCT GGCCGGCXSAC GGCCGGTGCA AGGCCTACGC CGAAGGCGCG GACGGCACGG 
GCTGGGCCGA GGGCGTCGGG GTCGTCGTGC TGGAGCGGCT GTCGGTGGCG CGCGAGCGCG 
GGCACCGGGT GCTGGCCGTG CTCCGCGGCA GCGCGGTCAA CTCCGACGGC GCGTCCAACG 
GCCTCACCGC CCCCAACGGG CCGTCGCAGC AACGGGTGAT CCGCCGGGCC CTGGCCGGCG 
CCGGCCTCGA ACCGTCCGAT GTGGACATCG TGGAAGGGCA CGGCACCGGG ACGGCGCTGG 
GCGACCCGAT CGAGGCGCAG GCCCTGCTGG CCACCTACGG CAAGGACCGC GACCCGGAGA 
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CGCCGTTCTG GCTGGGGTCG GTGAAGTCGA ACriCGGCCA CACGCAGTCC GCGGCCGGCG 
TGGCCGGGGT GATCAAGATG GTGCAGGCGC TGCGCCACGG CGTCATGCCG CCCACCCTGC 
ACGTGGACCG GCCCACCAGC CAGGTCGACT GGTCCGCGGG GGCCGTCGAA GTGCTGACCG 
AGGCACGGGA GTGGCCGCGG AACQGCCGTC CGCGCCGGGC CGGGGTGTCC TCGTTCGGGA 
TCAGCX3GCAC GAACGCCCAC CTGATCATCG AAGAAGCACC GGCCGAGCCA CAGCTTGCCG 
GACCACCGCC GGACGGCGGT GTGGTGCCGC TGGTCGTCTC GGCTCGCAGC CCCGGTGCCC 
TGGCCGGTCA GGCX3CGTCGG CTGGCCACGT TCCTCGGCGA CGGGCCCCTT TCCX3ACGTCG 
CCGGK3CGCT GACGAGCCGC GCCCTGTTCG GCGAGCGCGC GGTCGTCGTG GCGGATTCGG 
CCGAGGAAGC CCGCGCCGGT CTGGGCGCAC TGGCCCGCGG CGAAGACGCG CCGGGCCTGG 
TCCGCGGCCG GGTGCCCGCG TCCGGCCTGC CGGGCAAGCT CGTGTGGGTG TTCCCCGGGC 
AGGGGACGCA GTGGGTCGGC ATGGGCCGCG AACTCCTCGA AGAGTCTCCG GTGTTCGCCG 
AGCGGATCGC CGAGTGTGCG GC(X3CGCTGG AGCCGTGGAT CGGCTGGTCG CTGTTCGACG 
TCCTCCGTGG CGACGGTGAC CTCGATCGGG TCGATGTGCT GCAGCCCGCG TGCTTTGCGG 
TGATGGTCGG CTTGGCCGCG GTGTGGTCCT CGGCCGGGGT GGTCCCCGAT GCGGTGCTCG 
GCCACTCCCA GGGTGAGATC GCCGCGGCGT GCGTGTCGGG TGCGTTGTCG CTGGAGGATG 
CGGCGAAGGT GGTTGCCCTG CGCAGCCAGG CCATCGCCGC GAAGCTCTCC GGCCGCGGCG 
GGAOX^GCTTC GGTCGCCTTG GGCGAAGCCG ATGTGGTGTC GCGGCTGGCG GACGGGGTCG 
AGGTCGCTGC CGTCAACGGT CCGGCGTCCG TGGTGATCGC GGGGGATGCC CAGGCCCTCG 
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ACGAAACGCT GGAAGCGCTG TCCGGTCCGG GAATCCGGGC TCGGCC3C3GTG GCGGTGGACT 5520 

ACGCCTCGCA CACCCGGCAC GTCGAAGACA TCGAAGACAC CCTCGCCGAA GCX3CTGGCCG 5580 

GGATCGACGC CCGGGCGCCG CTCGOXXXGT TCCTCTCCAC CCTCACCX3GC GAGTGGATCC 5640 

GGGftCGAGGG CGTCGTGGAC GGCGGCTACT GGTACC 5676 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1891 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) i«DLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Tyr Pro Val Phe Ala Thr Ala Phe Asp Glu Ala Cys Glu Gin Leu Asp 
l' 5 10 15 

Val Cys Leu Ala Gly Arg Ala Gly His Arg Val Arg Asp Val Val Leu 
20 25 30 

Gly Glu Val Pro Ala Glu Thr Gly Leu Leu Asn Gin Thr Val Phe Thr 
35 40 45 

Gin Ala Gly Leu Phe Ala Val Glu Ser Ala Leu Phe Arg Leu Ala Glu 
50 55 60 

Ser Trp Gly Val Arg Pro Asp Val Val Leu Gly His Ser He Gly Glu 
65 70 75 80 
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Ile Thr Ala Ala Tyr Ala Ala Gly Val Phe Ser Leu Pro Asp Ala Ala 
85 90 95 

Arg He Val Ala Ala Arg Gly Arg Leu Met Gin Ala Leu Ala Pro Gly 
100 105 110 

Gly Ala Met Val Ala Val Ala Ala Ser Glu Ala Glu Val Ala Glu Leu 
115 120 125 

Leu Gly Asp Gly Val Glu Leu Ala Ala Val Asn Gly Pro Ser Ala Val 
130 135 140 

Val Leu Ser Gly Asp Ala Asp Ala Val Val Ala Ala Ala Ala Arg Met 
145 150 155 160 

Arg Glu Arg Gly His Lys Thr Lys Gin Leu Lys Val Ser His Ala Phe 
165 170 175 

His Ser Ala Arg Met Ala Pro Met Leu Ala Glu Phe Ala Ala Glu Leu 
180 185 190 

Ala Gly Val Thr Trp Arg Glu Pro Glu He Pro Val Val Ser Asn Val 
195 200 205 

Thr Gly Arg Phe Ala Glu Pro Gly Glu Leu Thr Glu Pro Gly Tyr Trp 
210 215 220 

Ala Glu His Val Arg Arg Pro Val Arg Phe Ala Glu Gly Val Ala Ala 
225 230 235 240 

Ala Thr Glu Ser Gly Gly Ser Leu Phe Val Glu Leu Gly Pro Gly Ala 
245 250 255 

Ala Leu Thr Ala Leu Val Glu Glu Thr Ala Glu Val Thr Cys Val Ala 
260 265 270 
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Ala Leu Arg Asp Asp Arg Pro Glu Val Thr Ala Leu lie Thr Ala Val 
275 280 285 

Ala Glu Leu Phe Val Arg Gly Val Ala Val Asp Trp Pro Ala Leu Leu 
290 295 300 

Pro Pro Val Thr Gly Phe Val Asp Leu Pro Lys Tyr Ala Phe Asp Gin 
305 310 315 320 

Gin His Tyr Trp Leu Gin Pro Ala Ala Gin Ala Thr Asp Ala Ala Ser 
325 330 335 

Leu Gly Gin Val Ala Ala Asp His Pro Leu Leu Gly Ala Val Val Arg 
340 345 350 

Leu Pro Gin Ser Asp Gly Leu Val Phe Thr Ser Arg Leu Ser Leu Lys 
355 360 365 

Ser His Pro Trp Leu Ala Asp His Val He Gly Gly Val Val Leu Val 
370 375 380 

Ala Gly Thr Gly Leu Val Glu Leu Ala Val Arg Ala Gly Asp Glu Ala 
385 390 395 400 

Gly Cys Pro Val Leu Glu Glu Leu Val He Glu Ala Pro Leu Val Val 
405 410 415 

Pro Asp His Gly Gly Val Arg He Gin Val Val Val Gly Ala Pro Gly 
420 425 430 

Glu Thr Gly Ser Arg Ala Val Glu Val Tyr Ser Leu Arg Glu Asp Ala 
435 440 445 



Gly Ala Glu Val Trp Ala Arg His Ala Thr Gly Phe Leu Ala Ala Thr 
450 455 460 



Pro Ser Gin His Lys Pro Phe Asp Phe Thr Ala Trp Pro Pro Pro Gly 
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465 



470 



475 



480 



Val Glu Arg Val Asp Val Glu Asp Phe Tyr Asp Gly Phe Val Asp Arg 
485 490 495 

Gly Tyr Ala Tyr Gly Pro Ser Phe Arg Gly Leu Arg Ala Val Trp Arg 
500 505 510 

Arg Gly Asp Glu Val Phe Ala Glu Val Ala Leu Ala Glu Asp Asp Arg 
515 520 525 

Ala Asp Ala Ala Arg Phe Gly He His Pro Gly Leu Leu Asp Ala Ala 
530 535 540 

Leu His Ala Gly Met Ala Gly Ala Thr Thr Thr Glu Glu Pro Gly Arg 
545 550 555 560 

Pro Val Leu Pro Phe Ala Trp Asn Gly Leu Val Leu His Ala Ala Gly 
565 570 575 

Ala Ser Ala Leu Arg Val Arg Leu Ala Pro Ser Gly Pro Asp Ala Leu 
580 585 590 

Ser Val Glu Ala Ala Asp Glu Ala Gly Gly Leu Val Val Thr Ala Asp 
595 600 605 



Ser Leu Val Ser Arg Pro Val Ser Ala Glu Gin Leu Gly Ala Ala Ala 
610 615 620 

Asn His Asp Ala Leu Phe Arg Val Glu Trp Thr Glu He Ser Ser Ala 
625 630 635 640 

Gly Asp Val Pro Ala Asp His Val Glu Val Leu Glu Ala Val Gly Glu 
645 650 655 



Asp Pro Leu Glu Leu Thr Gly Arg Val Leu Glu Ala Val Gin Thr Trp 
660 665 670 
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Leu Ala Asp Ala Ala Asp Asp Ala Arg Leu Val Val Val Thr Arg Gly 
675 680 685 

Ala Val His Glu Val Thr Asp Pro Ala Gly Ala Ala Val Trp Gly Leu 
690 695 700 

He Arg Ala Ala Gin Ala Glu Asn Pro Asp Arg He Val Leu Leu Asp 
705 710 715 720 

Thr Asp Gly Glu Val Pro Leu Gly Arg Val Leu Ala Thr Gly Glu Pro 
725 730 735 

Gin Thr Ala Val Arg Gly Ala Thr Leu Phe Ala Pro Arg Leu Ala Arg 
740 745 750 

Ala Glu Ala Ala Glu Ala Pro Ala Val Thr Gly Gly Thr Val Leu He 
755 760 765 

Ser Gly Ala Gly Ser Leu Gly Ala Leu Thr Ala Arg His Leu Val Ala 
770 775 780 

Arg His Gly Val Arg Arg Leu Val Leu Val Ser Arg Arg Gly Pro Asp 
785 790 795 .800 

Ala Asp Gly Met Ala Glu Leu Thr Ala Glu Leu He Ala Gin Gly Ala 
805 . 810 815 

Glu Val Ala Val Val Ala Cys Asp Leu Ala Asp Arg Asp Gin Val Arg 
820 825 830 

Val Leu Leu Ala Glu His Arg Pro Asn Ala Val Val His Thr Ala Gly 
835 840 845 



Val Leu Asp Asp Gly Val Phe Glu Ser Leu Thr Arg Glu Arg Leu Ala 
850 855 860 
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Lys Val Phe Ala Pro Lys Val Thr Ala Ala Asn His Leu Asp Glu Leu 
865 870 875 880 

Thr Arg Glu Leu Asp Leu Arg Ala Phe Val Val Phe Ser Ser Ala Ser 
885 890 895 

Gly Val Phe Gly Ser Ala Gly Gin Gly Asn Tyr Ala Ala Ala Asn Ala 
900 905 910 

Tyr Leu Asp Ala Val Val Ala Asn Arg Arg Ala Ala Gly Leu Pro Gly 
915 920 925 

Thr Ser Leu Ala Trp Gly Leu Trp Glu Gin Thr Asp Gly Met Thr Ala 
930 935 940 

His Leu Gly Asp Ala Asp Gin Ala Arg Ala Ser Arg Gly Gly Val Leu 
945 950 955 960 

Ala lie Ser Pro Ala Glu Gly Met Glu Leu Phe Asp Ala Ala Pro Asp 
965 970 975 

Gly Leu Val Val Pro Val Lys Leu Asp Leu Arg Lys Thr Arg Ala Gly 
980 985 990 

Gly Thr Val Pro His Leu Leu Arg Gly Leu Val Arg Pro Gly Arg Gin 
995 1000 1005 

Gin Ala Arg Pro Ala Ser Thr Val Asp Asn Gly Leu Ala Gly Arg Leu 
1010 1015 1020 

Ala Gly Leu Ala Pro Ala Glu Gin Glu Ala Leu Leu Leu Asp Val Val 
1025 1030 1035 1040 



Arg Thr Gin Val Ala Leu Val Leu Gly His Ala Gly Pro Glu Ala Val 
1045 1050 1055 



Arg Ala Asp Thr Ala Phe Lys Asp Thr Gly Phe Asp Ser Leu Thr Ser 
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1060 



1065 



1070 



Val Glu Leu Arg Asn Arg Leu Arg Glu Ala Ser Gly Leu Lys Leu Pro 
1075 1080 1085 

Ala Thr Leu Val Phe Asp Tyr Pro Thr Pro Val Ala Leu Ala Arg Tyr 
1090 1095 1100 

Leu Arg Asp Glu Phe Gly Asp Thr Val Ala Thr Thr Pro Val Ala Thr 
1105 1110 1115 1120 

Ala AJLa Ala Ala Asp Ala Gly Glu Pro He Ala He Val Gly Met Ala 
1125 1130 1135 

Cys Arg Leu Pro Gly Gly Val Thr Asp Pro Glu Gly Leu Trp Arg Leu 
1140 1145 1150 

Val Arg Asp Gly Leu Glu Gly Leu Ser Pro Phe Pro Glu Asp Arg Gly 
1155 1160 1165 

Trp Asp Leu Glu Asn Leu Phe Asp Asp Asp Pro Asp Arg Ser Gly Thj: 
1170 1175 . 1180 

Thr Tyr Thr Ser Arg Gly Gly Phe Leu Asp Gly Ala Gly Leu Phe Asp 
1185 1190 1195 1200 

Ala Gly Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro 
1205 1210 1215 

Gin Gin Arg Leu Leu Leu Glu Ala Ala Trp Glu Ala Leu Glu Gly Thr 
1220 1225 1230 



Gly Val Asp Pro Gly Ser Leu Lys Gly Ala Asp Val Gly Val Phe Ala 
1235 1240 1245 



Gly Val Ser Asn Gin Gly Tyr Gly Met Gly Ala Asp Pro Ala Glu Leu 
1250 1255 1260 
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Ala Gly Tyr Ala Ser Thr Ala Gly Ala Ser Ser Val Val Ser Gly Arg 
1265 1270 1275 1280 

Val Ser Tyr Val Phe Gly Phe Glu Gly Pro Ala Val Thr He Asp Thr 
1285 1290 . 1295 

Ala Cys Ser Ser Ser Leu Val Aia Met His Leu Ala Gly Gin Ala Leu 
1300 1305 1310 

Arg Gin Gly Glu Cys Ser Met Ala Leu Ala Gly Gly Val Thr Val Met 
1315 1320 1325 

Gly Thr Pro Gly Thr Phe Val Glu Phe Ala Lys Gin Arg Gly Leu Ala 
1330 1335 1340 

Gly Asp Gly Arg Cys Lys Ala Tyr Ala Glu Gly Ala Asp Gly Thr Gly 
1345 1350 1355 1360 

Trp Ala Glu Gly Val Gly Val Val Val Leu Glu Arg Leu Ser Val Ala 
1365 1370 1375 

Arg Glu Arg Gly His Arg Val Leu Ala Val Leu Arg Gly Ser Ala Val 
1380 1385 1390 

Asn Ser Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser 
1395 1400 1405 

Gin Gin Arg Val He Arg Arg Ala Leu Ala Gly Ala Gly Leu Glu Pro 
1410 1415 1420 

Ser Asp Val Asp He Val Glu Gly His Gly Thr Gly Thr Ala Leu Gly 
1425 1430 1435 1440 



Asp Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Lys Asp Arg 
1445 1450 1455 
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Asp Pro Glu Thr Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Phe Gly 
1460 1465 1470 

His Thr Gin Ser Ala Ala Gly Val Ala Gly Val He Lys Met Val Gin 
1475 1480 1485 

Ala Leu Arg His Gly Val Met Pro Pro Thr Leu His Val Asp Arg Pro 
1490 1495 1500 

Thr Ser Gin Val Asp Trp Ser Ala Gly Ala Val Glu Val Leu Thr Glu 
1505 1510. 1515 1520 

Ala Arg Glu Trp Pro Arg Asn Gly Arg Pro Arg Arg Ala Gly Val Ser 
1525 1530 1535 

Ser Phe Gly He Ser Gly Thr Asn Ala His Leu He He Glu Glu Ala 
1540 1545 1550 

Pro Ala Glu Pro Gin Leu Ala Gly Pro Pro Pro Asp Gly Gly Val Val 
1555 1560 1565 

Pro Leu Val Val Ser Ala Arg Ser Pro Gly Ala Leu Ala Gly Gin Ala 
1570 1575 1580 

Arg Arg Leu Ala Thr Phe Leu Gly Asp Gly Pro Leu Ser Asp Val Ala 
1585 1590 1595 1600 

Gly Ala Leu Thr Ser Arg Ala Leu Phe Gly Glu Arg Ala Val Val Val 
1605 1610 1615 

Ala Asp Ser Ala Glu Glu Ala Arg Ala Gly Leu Gly Ala Leu Ala Arg 
1620 1625 1630 

Gly Glu Asp Ala Pro Gly Leu Val Arg Gly Arg Val Pro Ala Ser Gly 
1635 1640 1645 

Leu Pro Gly Lys lieu Val Trp Val Phe Pro Gly Gin Gly Thr Gin Trp 
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1650 1655 1660 

Val Gly Met Gly Arg Glu Leu Leu Glu Glu Ser Pro Val Phe Ala Glu 
1665 1670 1675 1680 

Arg He Ala Glu Cys Ala Ala Ala Leu Glu Pro Trp He Gly Trp Ser 
1685 1690 1695 

Leu Phe Asp Val Leu Arg Gly Asp Gly Asp Leu Asp Arg Val Asp Val 
1700 1705 1710 

Leu Gin Pro Ala Cys Phe Ala Val Met Val Gly Leu Ala Ala Val Trp 
1715 1720 1725 

Ser Ser Ala Gly Val Val Pro Asp Ala Val Leu Gly His Ser Gin Gly 
1730 1735 1740 

Glu He Ala Ala Ala Cys Val Ser Gly Ala Leu Ser Leu Glu Asp Ala 
1745 1750 1755 1760 

Ala Lys Val Val Ala Leu Arg Ser Gin Ala He Ala Ala Lys Leu Ser 
1765 1770 1775 

Gly Arg Gly Gly Met Ala Ser Val Ala Leu Gly Glu Ala Asp Val Val 
1780 17S5 1790 

Ser Arg Leu Ala Asp Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala 
1795 1800 1805 

Ser Val Val He Ala Gly Asp Ala Gin Ala Leu Asp Glu Thr Leu Glu 
1810 1815 1820 

Ala Leu Ser Gly Ala Gly He Arg Ala Arg Arg Val Ala Val Asp Tyr 
1325 1830 1835 1840 



Ala Ser His Thr Arg His Val Glu Asp He Glu A^p Thr Leu Ala Glu 
1845 1850 1855 
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Ala Leu Ala Gly lie Asp Ala Arg Ala Pro Leu Val Pro Phe Leu Ser 
1860 1855 1870 

Thr Leu Thr Gly Glu Trp He Arg Asp Glu Gly Val Val Asp Gly Gly 
1875 1880 1885 

Tyr Trp Tyr 
1890 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doioble 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAATTCCAGG CCGTCGACGG CTGCGACATC GCGGTCTTCC GGTGGTCGCA CCGCACGAAG 
ATCGCCGAAT AAGAATTTCC Ga^^TCTCCCA CGGGAAAGGT TTCCATGACC GACGCAATAT 
CCTTCGAGGT GCCGTGGGAC CGGACCGACA AGTTCGACCC GCCCGCGGTG TTCGACTCTC 
TGCGCGAAGA ACGTCCGCTC GCGAAGATGG TTTACCCGGA TGGGCACGTC GGCTGGATCG 
TTTCCAGCTA CGAGCTGGTC CGCGAGGTCC TCAGCGACCT GCGGTTCAGC CACAGCTGCG 
AAGTCGGCCA CTTCCCGGTG ACCCACCAGG GCCAGGTCAT CCCGACCCAC CCGCTGATCC 
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CCGGCATGTT CATCCACATC GACCCGCCCG AGCACACGCG CTACCGCAAG CTGCTGACCG 420 

GCGAGTTCAC CGTCCGCCGC GCXIAGCAGGC TGATCCCGCG GGCCGAGGCC GTGGCCGCCG 480 

AGCAGATCGA GGTCATGCGG GCCAAGGGCG CCCCCGCGGA CGTGGTCATG GACTTCGCCA 540 

AGCCGCTGGT GCTGCGGATC CTGGGCGAGC TCGTCGGCCT GCCCTACGAG GAACGCGACC 600 

GGTACGTCCC CGCGGTGftCC CTCCTGCACG ACGCCGAAGC GGACCCGGCC GAGGCCGCGG 660 

CCGCCTACGA GGTGGCCGGG AAGTTCTTCG ACGAGGTCAT CGAGCGCCGC CGGCAGCGGC 720 

Li. 

n CCCAGGACGA CCTCATCAGC TCGCTCGTCA CCGAGGACCT GACCCAGGAG GAGCTGCGCA 780 

5. 

ACATCGTCAC CCTCCTCCTG TTCGCCGGGT ACGAGACCAC CGAGGGCGCG CTCGCCACCG 840 

GCGTCTTCGC GCTGCTGCAC CACACCGATC AGCTGGCGGC ACTGCGCGCG GAGCCGGAAA 900 

C3 AGCTCGACGC CGCGATCGAA GAGCTGCTGC GCTACCTGAC CGTCAACCAG TACCACACCT 960 

S ACCGCACCGC GCTGGAGGAC GTGAAGCTGG AGGGCGAGCT GATCAAGAAG GGCGACACGG 1020 

zsse 

n I 

TCACGGTGTC GCTGCCCGCG GCCAACCGCG ACCCGGCCAA GTTCGGCTGT CCCGCGGAGC 1080 

TCGACATCGA GCGGGACACC TCCGGCCACG TCGCGTTCGG CTTCGGCATC CACCAGTGCC 1140 

TCGGCCAGAA CCTGGCGCGC ATCGAGCTGC GGGCCGGCTT CACGGCGCTC CTGCGGGCGT 1200 

TCCCCGAGCT CCGGCTGGCC GTCCCGGCCG ACGAGGTTCC GCTGCGGCTG AAGGGTTCCG 1260 

TCTTCTCGGT GAAGAAGCTG CCCGTCTCCT GGTGAGCGTT CTTCCCCTCG AACACCCGAA 1320 

AGGATCTGCG GCACAGTGCG CACCGATCTC ATCAAGCCAC TTCACGTCGC ACTCCTGGAG 1380 

AiVCGCGACCC GCTTCGCCGG CAAGCCGGCC TTCGCCGACG ACCACCGGAC GGTCACCTAC 1440 

GGCGACCTCG AGGCGCGGAC GCGCCGGCTG GCCGGGCACC TGGCCGGCCT CGGTGTCCGG 1500 
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CACGGCGACC GGGTGGCGAT CTGCCTCQGC AACCGGGTGT CCACTGTGGA GAGTTACTTC 
GCGATCCTCC GCGCGQGTGC CGTCGGCGTG CCGCTCAACC CCGGTTCGGC GACGGCCGAG 
CTCGAGCACC CGCTCACCGA CAGCGGCGCC ACGGTGGTCG TCACCGACGC CGCCCAGGCG 
GCCCGGCTCC GGCTCGCGCC GCACGTCGAG CTGCTGGTGA CCGGCGACGA CGTCCCGGAG 
GGCGCCCACT CCTACGACGA ACTCGCCCTC AGCGAACCGG CCGAGCCCGC CGCGGACGAC 
COXXSAGCTCG ACGAGCCGGC GTGGATGTTC TACACGTCGG GCACGACCGG GCGGCCCAAG 
GGCGTCGTGT CCACQCAGCG CAACTGCCTC TGGTCCGTCG CTTCCTGCTA CGTGCCGTTC 
CCCGGGTTGT CGGACXAGGA CCGGGTGCTC TGGCCGCTCC CGCTGTTCCA CAGCCTTTCG 
CACATCGCCT GCGTCCTGTC CGCCACCGTG GTCGGGGCCA GCGTCCGGAT CGCCGACGGC 
AGCTCCGCCG ACGACGTCAT GCGGCTGATC GAGGCGGAGA GCTCGACCTT CCTGGCCGGC 
GTGCCGACCA CCTACCACCA CCOXSGTGCGG GCCGCCCGGC AGCGCGGTTT CTCCGCGCCG 
AGCCTGCGGA TCGGCCTGGC CGGGGGCGCG GTCCTCGGCG CCGGGCTGCG AAGCGAGTTC 
GAAGAGACCT TCGGGGTCCC GCTGATCGAC GCCTACGGCA GCACCGAGAC CTGCGGGGCG 
ATCACCATGA ACCCGCCGGA CGGCGCCCGC GTCGAGGGCT CGTGCGGCTT GGCCGTGCCG 
GGCGTCGACG TGCGGGTCGT CGACCCCGAC ACCGGGCTCG ACGTCCCCGC CGGCGAGGAG 
GGCGAGGTCT GGGTCAGCGG GCCGAACGTC ATGCTCGGCT ACCACAACAG CCCGGAGGCG 
ACCGCCGCGG CGATGCGGGA CGGCTGGTTC CGGACCGGGG ACCTGGCCCG CCGCGACGAC 
GCCGGTTACT TCACCATCTG CGGCCGGATC AAGGAACTCA TCATCCGCGG CGGCGCGAAC 
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ATCCACCCCG GCGAGGTCGA GGCGGTCCTG CGCACGGTCG ACGGCGTCGC GGACGCGGCG 
GTCGGCGGTC TCCCGCACGA CACGCTCGGC GAGGTGCCGG TCGCCTACGT CATCCCCGGA 
CCGACCGGTT TCGATCCTGC GGCGTTGATC GAGAAGTGCC GCGAACAGCT GTCCGCCTAC 
AAGGTGCCGG ACCGGATCCT CGAGGTCGCC CACATTCCCC GGACCGCGTC GGGCAAGATC 
CGGCGCGGGC TGCTGACCGA CGAGCCCGCG CAGCTGCGGT ACGCCGCGAC CGAACACGAG 
GAACAGTCCC ggcacgcx:ga CGAGTCCGTC GCGGCGGCGC TGCGCGCGCG ACTGTCCGGT 

ttggacgaac gcgcccagtg cgagctcctg gaagacctcg tccgcaccca ggcggccisac 
gtgctggggc agccggtccc ggacgggcgt gcgttccgcg acctcggctt cacgtcgctg 
gccatcgtgg agctgcgcaa ccggctgacc gagcacaccg ggctctggct gcccgccagc 
gccgtcttcg accaccccac gccggcggcg ctggccgccc gcgtccgggc tgagctcctc 

GGGATCACGC AGGCCGTCGC GGAGCCGGTC GTCGCGGCCG ACCCGGGCGA GCCGATCGCG 
ATCGTGGGGA TGGCCTGCCG CCTGCCGGGT GGCGTGGCGT CCCCGGAAGA CCTGTGGCGG 
CTCGTGGCCG AGCGCGTCGA CGCCGTTTCG GAGTTCCCCG GCGACCGCGG CTGGGACCTG 
GACAGCCTGA TCGACCCGGA CCGGGAGCGC GCCGGGACGT CGTACGTCGG CCAGGGCGGA 
TTCCTGCACG ACGCCGGCGA GTTCGACGCC GGGTTCTTCG GGATCTCGCC GCGTGAGGCC 

gtcgcgatgg acccgcagca gcggttgctg ctggagacgt cgtgggaggc cctcgaaaac 
gccggagtcg acccgatcgc gttgaagggc acca^caccg gcgtgttctc cggcctcatg 
ggccaggggt acgggtccgg cgcggtggcg ccggagctcg aaggtttcgt caccaccggg 

GTCGCGTCGA GCGTGGCCTC GGGCCGGGTG TCGTACGTGC TGGGACTGGA AGGCCCGGCG 
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GTCACCGTGG ACACCGCGTG TTCGTCGTCG CTGGTCGCGA TGCACCTGGC CGCGCAGGCC 
CTCCGGCAGG GCGAATGCTC GATGGCGCTC GCCGGCGGGG TCACGGTGAT GGCCACGCCG 
GGCTCGTTCG TCGAGTTCTC CCGCCAGCGG GCCCTGGCGC CCGACGGGCG CTGCAAGGCC 
TTCGCGGCGG CGGCCGACGG GACCGGCTGG TCCGAGGGTG TCGGCGTGGT CGTCCTCGAG 
CGGCTGTCCG TGGCGCGCGA GCGGGGCCAC CGGATCCTGG CCGTTTTGCG TGGCAGCGCG 
GTCAACCAGG ACGGCGCGTC CAACGGGCTC ACCGCGCCGA ACGGCCTCTC GCAGCAGCGG 
GTCATCCGCC GCX3CGCTCGC CGCGGCCGGG CTGGCACCGT CCGATGTGGA CGTCGTCGAG 
GCGCACGGCA CCGGGACCAC GCTGGGTGAC CCGATCGAGG CGCJ^GGCCCT GCTGGCGACC 
TACGGCCAGG AGCGGAAGCA GCCGTTGTGG CTCGGTTCGC TCA^GTCGAA CATCGGCCAC 
GCGCAGGCGG CCGCGGGCGT TGCGGGCGTC ATCAAGATGG TGCAGGCGCT GCGGCACGAG 
ACCTTGCCGC CGACGCTGCA TGTCGACAAG CCGACTCTTG AGGTGGACTG GTCCGCCGGT 
GCCATTGAAC TGCTGACGGA GGCCCGTGCG TGGCCGCGCA ACGGCCGTCC GCGCCGGGCC 
GGGGTCTCGT CGTTCGGCGT CAGCGGGACC A1«:GCGCACC TGATCCTGGA GGAGGCGCCG 
GCCGAGGAGC CGGTCGCTGC CCCGGAACTG CCGGTGGTGC CCCTGGTGGT GTCGGCGCGG 
AGCACGGAGT CGCTGTCCGG GCAGGCCGAG CGGCTGGCGT CCCTCCTCGA AGGGGACGTC 
TCGCTGACCG AGGTGGCCGG GGCGCTGGTG TCCCGCCGGG CGGTGCTGGA CGAGCGGGCC 
GTCGTCGTGG CCGGTTCGCG CGAGGAAGCC GTGACCGGGC TGCGGGCGCT GAACACGGCC 
GGTTCGGGGA CGCCGGGCAA GGTCGTGTGG GTGTTCCCGG GGCAGGGGAC GCAGTGGGCC 
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GGGATGGGCC GTGAGCTCCT GGCCGAGTCC CCGGTGTTCG CCGAGCGGAT CGCCGAGTGC 4860 

GCGGCCGCGT TGGCGCCGTG GATCGACTGG TCGCTCGTCG ACGTCCTGCG CGGCGAGGGC 4920 

GACCTGGGTC GGGTCGATGT GCTGCAGCCG GCCTGTTTCG CGGTGATGGT CGGGCTGGCT 4980 

GCCGTCTGGG AGTCCGTGGG GGTCCGGCCG GACGCCGTCG TCGGGCACTC GCAGGGTGAG 5040 

ATCGCGGCTG CCTGCGTTTC GGGGGCGTTG TCCCTCGAGG ACGCGGCGAA GGTGGTGGCC 5100 

CTCCGCAGCC AGGCCATCGC GGCGGAACTG TCCGGCCGCG GCGGGATGGC GTCGGTCGCC 5160 

b CTGGGCGAGG ACGACGTCGT TTCGCGGCTG GTGGACGGGG TCGAGGTCGC CGCCGTCAAC 5220 

f'ni ■ 

m GGCCCGTCGT CGGTGGTGAT CGCCGGGGAT GCCCATGCCC TCGACGCGAC CCTGGAAATC 5280 

m' TTGTCCGGGG AAGGCATCCG GGTTCGGCGG GTGGCGGTGG ACTACGCCTC GCACACCCGG 5340 

p. CATGTCGAGG ACATCCGCai^ CACTCTTGCC GAAACCTTGG CCGGGATCAG TGCGCAGGCG 5400 

^3 CCGGCTGTGC CGTTCTACTC CACCGTCACG AGCGAGTGGG TGCGCGACGC GGGGGTGCTG 5460 

GACGGCGGCT ACTGGTACCG GAACCTGCGC AACCAGGTCC GGTTCGGAGC GGCCGCGACG 5520 

GCCCTGCTCG AGCAGGGCCA CACGGTGTTC GTCGAGGTCA GTGCGCACCC GGTGACGGTC 5580 

CAGCCCTTGA GCGAGCTCAC CGGGGACGCG ATCGGGACAT TGCGGCGTGA AGACGGTGGC 5640 

CTGCGGCGGT TGCTGGCTTC GATGGGTGAG CTGTTCGTCC GCGGCATCGA CGTGa^VCTGG 5700 

ACGGCGATGG TGCCCGCGGC CGGCTGGGTC GACTTGCCGA CCTACGCGTT CGAACACCGG 5760 

CACTACTGGC TCGAGCCCGC CGAGCCCGCT TCGGCCGGAG ACCCGCTGCT GGGCACAGTC 5820 

GTCAGCACTC CCGGTTCGa^V CCGACTCACC GCCGTGGCGC AGTGGTCGCG CCGGGCGCAG 5880 

CCCTGGGCGG TGGACGGCCT GGTGCCGAAC GCGGCCCTGG TCGAGGCGGC CATCCGGCTC 5940 



GGCGACCTGG CCGGCACCCC CGTCGTCGGC GAACTGGTCG TCGACGCGCC GGTGGTGCTG 
CCGCGGCGCG GCAGCCGCGA GGTCCAGCTG ATCGTCGGCG AGCCCGGCGA GCAGCGGCGG 
CGTCCGATCG AGGTCTTTTC CCGGGAAGCC GACGAGCCGT GGACGCGGCA CGCGCACGGC 
ACACTCGCTC CCGCCGCCGC TGCGGTGCCA GAACCGGCGG CGGCGGGAGA CGCO^CCGAC 
GTCACCGTGG CCGGCCTGCG CGACGCGGAC CGGTACGGGA TCCACCCCGC GCTGCTGGAC 
GCCGCCGTCC GCACGGTCGT CGGCGACGAC CTGCTCCCGT CGGTGTGGAC CGGCGTGTCC 
CTGCTGGCCT CCGGGGCCAC GGCCGTGACC GTGACGCCGA CGGCGACCGG CCTGCGGCTG 
ACCGACCCGG CCGGGCAGCC CGTCCTGACC GTCGAATCCG TGCGCGGCAC GCCGTTCGTC 
GCCGAGCAGG GGACCACCGA CGCGCTCTTC CGCGTCGACT GGCCGGAAAT CCCGCTGCCC 
ACCGCCGAAA CCGCGGACTT CCTGCCGTAC GAAGCCACGT CGGCCGAGGC GACCCTCTCC 
GCGCTCCAGG CCTGGCTGGC AGACCCCGCG GAAACCCGGC TGGCCGTGGT CACCGGGGAC 
1GCACCGAAC CCGGCGCGGC CGCGATCTGG GGCCTGGTGC GCTCGGCGCA GTCCGAACAC 
CCCGGCCGGA TCGTGCTGGC CGACCTCGAC GACCCCGCCG TGCTGCCCGC CGTGGTGGCG 
AGCGGCGAAC CGCAGGTGCG GGTGCGCAAC GGCGTCGCCT CGGTGCCGCG CTTGACCCGG 
GTTACTCCCC GGCAGGACGC GCGGCCGCTC GACCCCGAGG GCACCGTCCT GATCACCGGC 
GGCACCGGCA CGCTCGGTGC GCTGACCGCC CGGCACCTCG TCACCGCGCA CGGCGTCCGG 
CACCTGGTGC TGGTCAGCCG CCGCGGTGAG GCTCCCGAGC TGCAGGAAGA ACTGACCGCA 
CTGGGGGCAT CCGTCGCCAT CGCCGCCTGC GACGTGGCAG ACCGGGCGCA GCTCGAAGCC 
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GTCTTGCGCG CGATCCCGGC CGAGCACCCG CTCACCGCCG TGATCCACAC CGCGGGGGTC 7080 

CTCGACGACG GCGTCGTCAC CGAGCTGACC CCGGACCGGC TCGCCACCGT GCGGCGGCCG 7140 

AAGGTCGACG CCGCCCGGCT CCTGGACGAG CTCACCCGGG AGGCCGATCT CGCCX3CGTTC 7200 

GTGCTGTTCT CCTCGGCGGC GGGTGTGCTG GGCAACCX:CG GCCAGGCCGG GTACGCCGCC 7260 

GCCAACGCCG AGCTCGATGC GTTGGCGCGC CAGCGGAACA GCCTCGACCT GCCCGCGGTG 7320 

TCCATCGCAT GGGGCTACTG GGCGACGGTC AGCGGGATGA CTGAGCACCT GGGCGACGCC 7380 

S GACCTCCGGC GCAACCAGCG GATCGGCATG TCCGGGCTTC CCGCCGACGA GGGCATGGCG 7440 

«=? 

CTGCTGGACG CCGCCATCGC CACCGGTGGC ACGCTGGTCG CGGCCAAGTT CGACGTCGCC 7500 

^- GCGCTCCGGG CGACGGCGAA GGCCGGCGGC CCGGTGCCGC CGCTGCTGCG TGGCCTGGCC 7560 

Ifl 

S3 CCGCTGCCGC GCCGGGCGGC GGCCAAGAOC GCGTCGCTGA CCGAACGCCT CGCCGGGCTG 7620 

?3 ■ 

GCCGAGRCCG AGCAGGCCGC GGCCCTGCTC GACCTGGTCC GGCGGCACGC CGCCGAGGTG 7680 

fii' 

CTCGGGCACA GCGGCGCCGA ATCCGTCCAT TCAGGACGGA CGTTCAAGGA CGCCGGCTTC 7740 

GRCTCGCTGA CCGCGGTGGA ACTGCGGAAC CGCCTCGCGG CCGCGACCGG GCTCACCCTG 7800 

TCCX:CQGCGA TGATCTTCGA CTACCCGAAG CCCCCGGCGC TCGCGGACCA CCTGCGCGCC 7860 

ARGCTCTTCG GATCGGCGGC GAACCGGCCG GCCGAGATCG GCACCGCCGC GGCCGAGGAG 7920 

CCGATCGCGA TCGTCGCGAT GGCGTGCCGC TTCCCCGGTG GCGTGCACAG CCCCGAGGAC 7980 

CTGTCGCGGC TCGTCGCCGA CGGCGCCG.AC GCCGTCACCG AGTTCCCCGC CGACCGCGGC 8040 

TGGGACACCG ACCGGCTCTA CCACGAAGAC CCCGACCACG AAGGCACGAC GTACGTCCGG 8100 

CACGGCGCCT TCCTCGACGA CGCCGCCGGG TTCGACGCCG CCTTCTTCGG CATCTCGCCG 8160 
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AACGAGGCGC TCGCCATGGA CCCGCAGCAG CGGCTGCTGC TGGAGACGTC CTGGGAGCTG 8220 

TTCGAGCGGG CCGCGATCGA CCCGACCACG CTGGCCGGCC AGGACATCGG CGTCTTCGCC 8280 

GGCGTCAACA GCCACGACTA CAGCATGCGG ATGCACCGGG CCGCCGGTGT CGAGGGCTTC 8340 

CGGCTCACCG GCGGTTCGGC CAGCGTGCTC TCCGGCCGCG TCGCCTACCA CTTCGGCGTC 8400 

GAAGGCCCX^G CCGTCACGGT CGACACGGCC TGCTCGTCTT CGCTGGTCGC GCTGCACATG 8460 

|,& GCGGTGCAGG CCCTCCAGCG CGGCGAGTGC TCCATGGCGC TCGCGGGCGG CGTGATGGTG 8520 

b 

]- ATGGGCACGG TCGAGACGTT CGTCGAGTTC TCGCGGCAGC GCGGGCTGGC CCCCGACGGC 8580 

m 

n^, CGCTGCAAGG CGTTCGCCGA CGGCGCGGAC GGCACCGGCT GGTCCGAGGG CGTCGGGCTG 8640 

yi 

s 



CTCCTGGTGG AGCGGCTGTC CGAGGCTCAG CGTCGCGGGC ACCAGGTCCT CGCCGTGGTC 8700 

CGCGGGTCGG CGGTCAACTC CGACGGCGCG TCGAACGGCT TGACGGCCCC GAACGGCCCG 8760 

f Z 

PJ TCCCAGCAGC GCGTGATCCG CAAGGCACTG GCCGCCGCCG GACTGTCCAC ATCGGACGTC 8820 

GACGCGGTGG AGGCGCACGG CACCGGGACG ACCCOXSGGCG ACCCGATCGA GGCCGAGGCG 8880 

CTGCTGGCCA CCTACGGCCA GAACCGGGAA ACGCCGCTGT GGCTCGGGTC GGTGAAGTCG 8940 

AACCTCGGGC ACACGCAGGC G3CTGCGGGT GTCGCAGGCG TGATCAAGAT GGTCATGGCC 9000 

ATGCGCCACG GCGTCCTGCC CCGGACGCTG CACGTCGACC GGCCGTCGTC CTATGTGGAC 9060 

TG3TCGGCCG GTCCGGTCGA GCTGCTGACC GAGGCACGGG ACTGGGTGAG CAACGGCCAC 9120 

CCGCGCCGCG CGGGCGTGTC GTCGTTCGGC ATCGGCGGCA CCAACGCGCA CGTCGTCCTC 9180 

GAAGAGGTTG CCGCACCGAT CACCACGCCG CAGCCTGAGC CGGCCGAGTT CCTGGTGCCG 9240 
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GTGCTCGTCT CCGCGCGGAC GGCGGCGGGT CTGCGCGGCC AGGCCGGACG GCTCGCCGCG 9300 

TTCCTCGGCG ACCGGACCGA CGTCCGCGTC CCCGATGCCG CCTACGCACT GGCCACCACG 9360 

CGCGCCCAGC TCGACCACCG GGCCGTCGTC CTGGCCTCCG ACCGGGCACA GCTCTGCGCG 9420 

a».CCTTGCCG CGTTCGGCTC CGGCGTCGTG ACCGGAACGC CGGTTGACGG CAAGCTGGCC 9480 

GTGCTCTTCA CCGGCCAGGG CAGCCAGTGG GCCGGGATGG GCCGTGAACT CGCCGAGACG 9540 

TTCCCGGTCT TCCGCGACGC CTTCGAGGCC GCGTGCGAGG CCGTGGACAC GCACCTGCGT 9600 

r% GAGCGTCCGC TGCGCGAGGT CGTGTTCGAC GACAGCGCGC TGCTCGACCA GACGATGTAC 9660 

1 

Tn ACCCAGGGCG CCCTGTTCGC CGTGGAGACC GCGTTGTTCC GGCTCTTCGA GTCCTGGGGT 9720 

I ij 

!ij GTGCGGCCGG GTCTCCTCGC CGGTCACTCG ATCGGCGAGC TCGCCGCCGC GCACGTGTCC 9780 

yi 

O; GGCGTGCTGG ACCTGGCCGA CGCGGGCGAG CTGGTCGCCG CGCGCGGCCG GCTGATGCAG 9840 

b- 

yj GCCCTGCCCG CGGGCGGCGC GATGGTCGCC GTCCAGGCGA CCGAGGACGA AGTCGCGCCC 9900 

CTGCTCGACG GCACGGTCTG CGTCGCCGCG GTCAACGGTC CGGACTCGGT GGTGCTCTCC 9960 

GGCACCGAAG CCGCCGTGCT CGCCGTCGCG GATGAACTGG CTGGTCGCGG CCGTAAGACC 10020 

CGACGGCTGG CCGTGAGCCA CGCCTTCCAC TCGCCGCTCA OX^GAACCGAT GCTCGACGAC 10080 

TTCCGCGCGG TCGCCGAACG CCTGACGTAC CGGGCCGGTT CGCTGCCCGT CGTCTCGACG 10140 

CTGACCGGGG AACTCGCGGC GCTCGACAGC CCGGACTACT GGGTCGGCCA GGTGCGCAAC 10200 

GCCGTGCGGT TCAGCGACGC CGTCACCGCG CTGGGCGCCC AAGGCGCGTC GACGTTCCTC 10260 

GAGCTCGGCC CGGGCGGTGC GCTCGCCGCG ATGGCGCTCG GCACGCTCGG CGGACCCGAG 10320 

CAGAGCTGCG TCGCGACCCT GCGCAAGAAC GGCGCCGAGG TGCCCGACGT CCTCACCGCG 10380 
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CTCGCCGAAC TGCACGTCCG GGGCGTGGGC GTCGACTGGA CGACCGTGCT CGACGAACCG 10440 

GCCACGGCGG TCGGGACCGT CCTGCCCACC TACGCGTTCC AGCACCAGCG CTTCTGGGTC 10500 

GACGTCGACG AAACAGCGGC CGTCAGCGTC ACCCCGCCGC CGGCGGAGCC GATCGTGGAC 10560 

CGGCCGGTGC AGGACGTGCT GGAGCTGGTC CGGGAGAGCG CCGCGGTGGT GCTCGGGCAC 10620 

CGGGACGCCG GCAGTTTCGA CCTCGACCGG TCCTTCAAGG ACCACGGCTT CGACTCGCTC 10680 

AGCGCGGTCA AGCTGCGCAA CCGTCTGCGC GACTTCACCG GCGTGGAGCT GCCO^GCACC 10740 

CJ CTGATCTTCG ACTACCCGAA CCCGGCCGTC CTCGCGGACC ACCTGCGGGC CGAACTGCTC 10800 

m 

CI GGCa^VGCGCC CGGCCGCGCC GGCCCCGGTG ACGAGGGACG TCTCCGACGA GCCGATCGCG 10860 

CI- 
U1 

. ATCGTCGGCA TGAGCACCCG GCTGCCGGGT GGCGCCGACA GCCCCGAAGA GCTGTGGAAG 10920 

□ 

£1. CTCGTCGCGG AGGGACGGGA CGCCGTGTCC GGCTTCCCCG TCGACCGCGG CTGGGftCCTC 10980 

SI GACGGCCTCT ACCACCCGGA CCCCGCCCAC GCCGGGACGA GCTACACGCG TTCGGGCGGC 11040 

TTCCTGCACG ACGCGGCCCA GTTCGACGCC GGGCTCTTCG GGATCTCACC GCGTGAGGCC 11100 

CTGGCCATGG ACCCGCAGCA GCGGCTGCTG CTGGAGACGT CGTGGGAAGC CTTGGAGCGC 11160 

GCGGGGGTCG ACCCGCTGTC CGCCCGCGGC AGCGACGTCG GCGTCTTCAC CGGGATCGTC 11220 

CACCACGACT ACGTGACGCG GCTGCGCGAA GTGCCCGAAG ACGTCCAGGG CTACACGATG 11280 

ACCGGCACGG CTTCGAGCGT GGCGTCGGGC CGGGTGGCGT ACGTCTTCGG CTTCGAGGGC 11340 

CCGGCGGTCA CCGTGGACAC CGCGTGTTCG TCGTCGCTGG TCGCGATGCA CCTGGCGGCG 11400 

CAGGCGCTGC GGCAGGGGGA GTGCTCGATG GCCCTGGCCG GCGGCGCGAC CGTGATGGCC 11460 
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AGCCCGGACG CCTTCCTCGA GTTCTCCCGC CAGCGCGGCC TGTCCGCGGA CGGCCGGTGC 11520 

AAGGCGTACG CGGAAGGCGC GGACGGCACG GGCTGGQCCG AGGGCGTCGG TGTCGTCGTC 11580 

CTCGAACGGC TTTCGGTCGC ACGCGAACGT GGCCACCGGG TGCTGGCGGT CCTGCGCGGC 11640 

AGCGCGGTGA ACCAGGACGG TGCTTCCAAC GGCCTGACCG CCCCGAACGG GCCGTCGCAG 11700 

CAGCGGGTGA TCCGCGGCGC GCTGGCGAGC GCCGGGCTGG CACCGTCCGA TGTGGACGTC 11760 

GTGGAGGGCX: ACGGGACCGG GACCGCGCTG GGTGACCCGA TCGAGGTCCA GGCGCTGCTG 11820 

fj GCCACCTACG GGCAGGAGCG GGAACAGCCG TTGTGGCTCG GCTCGCTGAA GTCGAACCTC 11880 

jl GGGCACACGC AGGCCGCGGC CGGGGTCGTG GGCGTGATCA AGATGATCAT GGCCATGCGC 11940 

CACGGCGTCA TGCCGGCCAC GCTCCACGTC GACGAGCGCA CGAGCCAGGT CGACTGGTCG 12000 

m 

p: GCGGGCGCGA TCGAGGTGTT GACCGAGGCC CGGGAGTGGC CGCGCACCGG ACGTCCGCGC 12060 

yS CGGGCCGGGG TGTCCTCCTT CGGCGCCAGC GGCACCAACG CGCACCTGAT CATCGAGGAA 12120 

w. 

K 3 

GGTCCCGCCG AAGAGGCCGT GGACGAAGAG GTGGCCTCCG TGGTGCCGCT GGTCGTCTCC 12180 

GCCCGCAGCG CCGGTTCGCT GGCCGGGCAG GCCGGGCGCC TGGCCGCGGT CCTCGAGAAC 12240 

GAATCGTTGG CCGGGGTGGC CGGTGCCCTG GTTTCCGGCC GCGCGACGCT GAACGAGCGC 12300 

GCGGTCGTCA TCGCGGGCTC CCGCGACGAG GCCCAGGACG GCCTGCAGGC ACTGGCCCGC 12360 

GGCGAGAACG CGCCCGGCGT CGTGACCGGG ACGGCGGGCA AGCCGGGCAA GGTCGTCTGG 12420 

GTCTTCCCCG GCCAGGGCTC GCAGTGGATG GGCATGGGCC GGGACCTCCT GGACTCCTCG 12480 

CCGGTGTTCG CCGCGCGGAT CAAGGAATGC GCTGCGGCAC TGGAACAGTG GACCGACTGG 12540 

TCGC1GCTGG ACGTGCTGCG CGGCGACGCC GACCTGCTGG ACCGGGTCGA CGTGGTGCAG 12600 



-65- 



CCGGCCAGCT TCGCGATGAT GGTCGGGCTC GCCGCGGTGT GGACCTCGCT GGGGGTGACC 12660 

CCGGATGCGG TGCTCGGCCA CTCCCAGGGC GAGATCGCCG CGGCGTGCGT GTCCGGCGCG 12720 

CTGTCGCTCG ACGACGCGGC GAAGGTGGTC GCGTTGCGCA GCCAGGCGAT CGCGGGGGAG 12780 

CTGGCGGGCC GCGGCGGGAT GGCGTCGGTC GCACTGAGCG AAGAGGACGC AGTCGCGCGG 12840 

CTGACGCCGT GGGCGJ^ACCG GGTCGAGGTC GCCGCGGTCA ACAGCCCGTC CTCGGTCGTC 12900 

ATCGCGGGAG ACGCQCAGGC CCTCGACGAA GCCCTCGAAG CCCTGGCCGG CGACGGTGTC 12960 

C3 

0 CGGGTCCGGC GGGTCGCGGT GGACTACGCC TCCCACACCC GGCACGTCGA GGCGATCGCC 13020 

01 GA-AftCCCTCG CCAAGACCTT GGCCGGGATC GACGCGCGGG TTCCGC-CGAT TCCGTTCTAT 13080 

« 3- 

= :« 

^" TCCACCGTCC TGGGCACGTG GATCGAGCAG GCCGTCGTCG ACGCGGGCTA CTGGTACCGG 13140 

P- 

i"^CCTGCGGC AGCAGGTGCG GTTCGGCCCC TCGGTGGCGG ACCTGGCCGG GCTGGGGCAC 13200 

£3 

P ACGGTGTTCG TGGAGATCAG CGCCCACCCG GTGCTGGTCC AGCCGCTGAG CGAGATCAGC 13260 

GACGACGCGG TCGTCACCGG GTCGCTGCGG CGGaJVCGACG GGGGACTGCG GCGCCTGCTG 13320 

GCGTCGGCGG CCGAACTGTA CGTCCGGGGC GTGGCCGTGG ACTGGACGGC GGCCGTGCCC 13380 

GCGGCCGGCT GGGTGGACCT GCCGACGTAC GCCTTCGACC GCCGCCACTT CTGGCTGCAC 13440 

GAAGCCGAGA CCGCCGAAGC CGCCGAGGGC ATGGACGGCG AGTTCTGGAC GGCGATCGAA 13500 

CAGTCCGATG TGGACAGCTT GGCCGAGCIG CTCGAGCTGG TGCCGGAGCA GCGCGGGGCG 13560 

CTCAGCACCG TCGTGCCCGT GCTGGCGCAG TGGCGGGACC GGCGCCGCGA GCGCTCGACC 13620 

GCGGAGAAGC TGCGCTACCA GGTCACCTGG CAGCCCCTGG AGCGCGAAGC CGCCGGCGTG 13680 
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CCGGGCGGGC GCTGGCTGGC CGTCGTCCX:G GCCGGCACCA CCGACGCGCT CCTGAAGGAG 13740 

CTGACCGGCC AGGGACTCGA CATCGTCCGG CTGGAGATCG AGGAAGCTTC GCGGGCACAG 13800 

CTCGCCGRGC AGCTGCGGAA CGTCCTGGCG GAGCACGACC TCACCGGCGT GCTGTCGCTG 13860 

CTCQCTCTCG ACGGCGGCCC CGCGGACGCG GCCGAGATCA CCGCGTCGAC GCTCGCGCTG 13920 

GTCCAGGCX:C TGGGCGACAC CACCACGTCC GCGCCGCTGT GGTGCCTCAC TTCCGGCGCG 13980 

GTGAACATCG GCATCCAGGA CGCCGTCACC GCACCGGCCC AGGCGGCCGT GTGGGGGCTC 14040 

GG^CGGGCCG TCGCGCTGGA GCGCCTCGAC CGGTGGGGCG GCCTGGTCGA CTTGCCCGCC 14100 

P- 
1= 

QCGATCGACG CCCGCACGGC TCAGGCCCTG CTCGGCGTCC TGAACGGCGC CGCCGGGGAA 14160 

21- 

m- ' 

m GACCAGCTCG CGGTCCGGCG COXiGGGCGTC TACCGCAQGC GGCTGGTCCG CAAGCCCGTG 14220 

n 

hi CCGGAGTCCG CGACGAGCCG GIGGGAACCC CGCGGCACGG TCCTGGTGAC CGGTGGGGCG 14280 

b. 

1% GAAGGACTCG GCCGGCACGC CTCGGTCTCG CTCGCGCAGT CCGGCGCCGA ACGGCTCATC 14340 

K 

GTCACCGGCA CCGACGGCGT CGACGAACTG ACGGCCGAGC OXSGCCGAGTT CGGCACCACG 14400 

GTCGRGTTCT GCGCCGACAC CGACCGGGAC GCGATCGCGC AGCTGGTGGC GGACTCGGAG 14460 

• GTCACCGCCG 1GGTGCACGC CGCGGACATC GCGO^GACCA GCTCCGTCGA CGACACCGGC 14520 

GTCGCCGACC TCGACGAGGT GTTCGCCGCG AAGGTGACCA CCGCGGTGTG GCTGGACCAG 14580 

CTCTTCGAGG ACACCCCGCT CGACGCGTTC GTCGTGTTCT CCTCGATCGC CGGCATCTGG 14640 
GGCGGTGGCG GGCAGGGCCC GGCGGGTGCG GCGAACGCCG TCCTCGACGC CCTGGTCGAA 14700 
TGGCGCCGGG CCCGCGGCCT CAAGGCGACG TCGATCGCCT GGGGCGCGCT CGACCAGATC 14760 
GGCATCGGCA TGGACGAGGC CGCCCTCGCC CAGCTGCGCC GCCGCGGTGT CATCCCGATG 14820 
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GCC3CCGCCGC TCGCGGTCAC CGCGATGGTG CAQSCGGTCG CCGGCAACGA GAAGGCCGTG 14880 

GCGGTCGCCG ACATOGACTG GGCCGCCTTC ATCCCGGCGT TCACCTCGGT CCGGCCCAGC 14940 

CCGCTGTTCG CCGATCTGCC CGAGGCGAAG GCCATCCTCC GGGCGGCGCA GGACGACGGC 15000 

GAAGACGGCG ACACCGCGTC GTCGCTCGCG GACTCCCTGC GCGCGGTCCC CGACGCCGAG 15060 

CAGAACCGCA TCCTGCTGAA GCTGGTCCGC GGCCACGCTT CGACGGTGCT CGGCCftCAGC 15120 

GGCGCCGAAG GCATCGGCCC GCGCCAGGCG TTCCAGGftGG TCGGCTTCGA CTCGCTGGCC 15180 

^. GCGGTCAACC TCCGCAACAG CCTGCACGCG GCCACCGGGC TGCGGCTGCC CGCGACGCTG 15240 

i 

ATCTTCGACT ACCXICACCCC GGAGGCGCTG GTCGGCTACC TGCGCGTCGA ACTCCTGCGG 15300 

GAGGCCGACG ACGGCCTGGA CGGGCGGGAA GACGACCTCC GGCGAGTCCT CGCGGCCGTG 15360 

CCGTTCGCCC GGTTCAAGGA GGCGGGCGTG CTGGACACGC TGCTCGGCCT CGCCGACACC 15420 

m GGCACCGAAC CGGGCACGGA CGCCX3AGACC ACCGAAGCGG CCCCGGCCGC CGACGACGCA 15480 

GAACTGATCG ACGCACTCGA CATCTCCGGT CTCGTGCAAC GAGCCCTCGG GCAGACGAGC 15540 

TCACCGCCGA TGGCGAACCA ATCGTGGAGG AAGAACATGT CCGCGCCGAA CGAGCAGATC 15600 

GTTGACGCAC TGCGCGCGTC QCTGAAGGAG AACGTCCGGC TTCAGCAGGA GAACAGCGCG 15660 

CTCGCCGCGG CCGCCGCGGA GCCCGTCGCG ATCGTCTCCA TGGCCTGCCG CTACGCGGGC 15720 
GGGATCCGCG GCCCGGAGGA CTTCTGGCGG GTGGTGTCGG AAGGCGCCGA CGTCTACACC 15780 
GGCTTCCCCG AGaACCGCGG CTGGGACGTC GAAGGCCTCT ACCACCCGGA CCCCGACAAC 15840 
CCCGGCACGA CGTACGTGCG GGAGGGCGCC TTCCTGCAGG ACGCGGCCCA GTTCGACGCC 15900 



01. 
CI. 
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GGGTTCTTCG GCATCTCGCC GCGCGAGGCG CTGGCCATGG ACCCCCAGCA GCGGCAGCTC 15960 

CTGGAGGTCT CCTGGGAGAC CTTGGAACGG GCCGGCATCG ACCCGCATTC GGTGCGGGGC 16020 

AGCGACATCG GCGTCTACGC CGGGGTCGTG CACCAGGACT ACGCCCCCGA CCTCAGCGGG 16080 

TTCGAAGGCT TCATGAGCCT GGAGCGCGCC CTGGGCACCG CGGGCGGTGT CGCCTCCGGC 16140 

CGGGTCGCCT ACACGCTCGG GCTCGAAGGC CCCGCTGTCA CCGTCGACAC GATGTGCTCG 16200 

TCGTCGCTCG TGGCGATTCA CCTTGCCGCG CAAGCTCTTC GCCGTGGTGA GTGCTCGATG 16260 

GCCCTCGCGG GCGGCTCGAC CGTGATGGCG ACXICCGGQCG GGTTCGTCGG CTTCGCGCGT 16320 

CAfiCGGGCGT TCGCCTTCGA CQGGCGCTGC AAGTCCTACG CTGCGGCCGC CGACGGTTCC 16380 

GGCTGGQCCG AGGGCGTCGG CGTGCTGCTG CTGGAGCGGC TGTCGGTGGC GCGCGAGCGC 16440 

GGGCACCAGG TGCTGGCCGT CATCCGCGGC AGCGCGGTCA ACCAGGACGG CGCTTCCAAC 16500 

GGCCTGftCCG CGCCCAACGG CCCGGCGCAG CAGCGGGTCA TCCGCAAGGC ACTGGCGAGC 16560 

GCCGGGCTGA CACCGTCCGA TGTGGACACC GTGGAGGGCC ACGGai^CGG CACCGTCCTC 16620 

GGCGACCCGA TCGAGGTCCA GGCGCTGCTG GCCACCTACG GCCAGGGCCG CGACCCGCAG 16680 

CAACCGCTGT GGCTGGGCTC GGTCAAGTCC GTCGTCGGGC ACACGCAGGC GGCATCCGGT 16740 

GTCGCCGGCG TGATCAAGAT GGTCCAGTCG CTGCGGCACG GGCAGCTCCC GGCGACCCAG 16800 

CACGTCGACG CGCCCACGCC GCAAGTGGAC TGGTCGGCCG GAGCGATCGA GCTGCTGGCC 16860 

GAGGGCCGGG AGTGGCCGCG CAACGGCCAC CCGCGCCGGG GCGGCATCTC GTCGTTCGGG 16920 

GCCAGCGGCA CGAACGCGCA CATGATCCTC GAAGAAGCGC CCGAGGACGA GCCGGTGACC 16980 

GAAGCGCCGG CGCCCACGGG TGTCGTACCG CTGGTGGTGT CGGCGGCGAC CGCTGCTTCC 17040 
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CTGGCCGCCC AGGCCGGTCG GCTGGCGGAG GTCGGCGACG TCTCCCTGGC GGATGTCGCC 17100 

GGGACGCTGG TGTCCGGCCG CGCGATGCTC AGCGAGCGCG CGGTCGTCGT GGCCGGCTCC 17160 

CACGAAGAAG CCGTGACCGG GCTGCGGGCG CTGGCCCGCG GCGAGAGCGC GCCCGGCCTG 17220 

CTTTCCGGCC GCGGCTCGGG GGTCCCGGGC AAGGTCGTCT GGGTGTTCCC CGGCCAGGGC 17280 

ACGCAGTGGG CCGGCATGGG CCGCGAGCTG CTGGACTCCT CGGAGGTGTT CGCCGCGCGG 17340 

ATCGCCGAGT GCGAGAGCGC GCTCGGGCGG TGGGTCGACT GGTCGCTGAC CGACGTGCTG 17400 

b 

□ CGCGGCGAGG CCGACCTGCT GGACCGGGTC GACGTGGTGC AACCGGCGAG CTTCGCCGTG 17460 

sits 

fli 

ATGGTCGGGC TTGCCGCCGT CTGGGCCTCC CTCGGCGTCG AGCCCGAGGC CGTGGTGGGC 17520 

y = - 
yi. • 

CACTCGCAGG GCGAGATCGC GGCCGCATGC GTGTCCGGGG CACTGTCCCT GGAGGACGCG 17580 

'"^ GCGAAGGTGG TGGCGTTGCG CAGCCAGGCG ATCGCCGCCT CGCTGGCCGG CCGGGGCGGC 17640 

£3 ATGGCGTCGG TCGCGTTGAG CGAAGAAGAC GCGACCGCGC GGCTCGAGCC GTGGGCGGGC 17700 

CGCGTGGAGG TCGCCGCCGT CAACGGGCCG ACGTCCGTGG TGATCGCCGG GGACGCCGAG 17760 

GCGCTGGACG AAGCCCTCGA CGCGCTCGAC GACCAAGGCG TCCGGATCCG GCGGGTGGCG 17820 

GTGGACTACG CCTCCCACAC CCGGCACGTC GAAGCCGCGC GCGACGCACT GGCCGAGATG 17880 

CTGGGCGGGA TCCGCGCGCA GGCGCCGGAA GTGCCGTTCT ACTCGACCGT GACCGGCGGC 17940 

TGGGTCGAAG ACGCCGGCGT GCTCGACGGC GGCTACTGGT ACCGGAACCT CCGCCGTCAG 18000 

GTGCGGTTCG GCCCGGCGGT GGCCGAGCTG ATCGAGCAGG GCCACCGGGT GTTCGTCGAG 18060 

GTCAGCGCGC ATCCCGTGCT GGTTCAGCCG ATCAACGAAC TCGTCGACGA CACCGAAGCC 18120 
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GTCGTCACCG GGACGCTGCG GCGCGAGGAC GGCGGCCTCC GGCGCCTGCT GGCCTCGGCG 18180 

GCCGAGCTCT TCGTCCGCGG CGTGACCGTG GACTGGTCCG GTGTGCTGCC ACCGTCCCGC 18240 

CGGGTCGAGC TGCCGACGTA CGCCTTCGAC CACCAGCACT ACTGGCTGCA GATGGGCGGG 18300 

TCGGCCACCG ACGCCGTGTC GCTGGGCCTG GCCGGCGCCG ACCACCCGCT GCTGGGCGCG 18360 

GTCGTCCCGC TGCCGCAGTC CGACGGGCTC GTCTTCACCT CGCGGCTGTC GCTaAAGTCG 18420 

CACCCGTCGC TGGCCGGGCA CGCGATCGGC GGGGTCGTGC TCATTCCGGG CACGGTGTAC 18480 

|,i GTCGACCTCG CGCTGCGCGC CGGCGACG.AG CTCGGCTTCG GCGTCCTGGA AGAGCTCGTG 18540 

u 

%' ATCGAGGCAC CGCTGGTGCT GGGCGAGCGC GGCGGCGTTC GCGTGCAGGT CGCCGTGAGC 18600 

via 

nJ 

GGGCCa^UVCG AGACCGGCTC GCGTGCGGTG GACGTCTTCT CCATGCGGGA AGACGGCGAC 18660 

= GA'VTGa'VCCC GGCACGCGAC CGGTCTCCTC GGGGCGTCGA CGTCCCGGGA ACCGAGCCGC 18720 

£3 TTCGACTTCG CCGCCTGGCC GCCGGCCGGG GCGGAGCCGA TCGACGTCGA AAACTTCTAC 18780 

'41' 

!?=:• 

RJ ACCGACCTCA CCGAGCGCGG GTACGCCTAC AGCGGCGCCT TCCAGGGCAT GCGGGCGGTC 18840 

TGGCGGCGCG GTGACGAGGT CTTCGCCGAG GTCGCGCTGC CTGACGACCA CCGCGAGGAC 18900 

GCCGGCAAGT TCGGCCTCCA CCCCGCCCTC CTCGACGCCG CTCTGCACAC GAACGCCTTC 18960 

GCGAACCCGG ACGACGACCG CAGTGTGCTG CCGTTCGCGT GGAACGGCCT GGTCCTGCAC 19020 

GCCGTGGGCG CGTCGGCGCT GCGGGTGCGG GTGGCGCCGG GCGGTCCGGA CGCGCTa^^-CG 19080 

TTCCAGGCCG CCGACGAGAC CGGTGGCCTG GTCGTCACCA TGGATTCGCT GGTGTCCCGC 19140 

GAGGTGTCGG CCGCGCAGCT GGAGACGGCG GCGGGCGAAG AGCGCGACTC GCTGTTCCAG 19200 

GTCGACTGGA TCGAGGTCCC CGCGACCGAG ACCGCGGCCA CCGAGCACGC CGAGGTGCTC 19260 
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GAAGCCTTCG GCGAGGCAGC GCCCCTCGAG CTGACCAGCC GGGTGCTGGA GGCCGTGCAG 
TCCTCGCTCG CCGACGCGGC CGACGAAGCA CGGTTGGTCG TGGTGACCCG TGGCGCCGTG 
CGCGAGGTGA CGGACCCGGC CGGTGCCGCC GTGTGGGGTT TGGTGCGAGC CGCCCAGGCG 
GRGAftCCCGG GCCGGATCAT CCTCGTCGAC ACCGACGQCG ACGTCCCGCT GGGTGCGGTG 
CTGGCCAGTG GTGAGCCGCA GCTCGCCGTG CGCGGCAACG CTTTCTCCGT CCCGCGCCTC 
GCCCGGGCCA CCGGCGAQGT GCCGGAGGCC CCCGCGGTGT TCAGTCCGGA AGGGACGGTC 
CTGCTCftCCG GCGGCACCGG CTCGCTGGGC GGTCTGGTGG CCAAGCACCT GGTTGCCCGG 
CACGGCGTCC GGCGGCTGGT GCTCGCCAGC CGCCGAGGCG TGGCCQCGGA AGACCTCGTC 
ACCGRGCTGA CCGAGCAGGG CGCGACGGTG TCCGTGGTGG CTTGCGACGT CTCCGACCGC 
GACCAGGTGG CCGCGTTGCT GGCCGAACAC CGCCCGACCG GCATCGTGCA CCTGGCCGGC 
CTGCTGGACG ACGGCGTCAT CGGAGCCCTG ARCCGGGAGC GGCTGGCCGG GGTGTTCGCG 
CCCAfiGGTCG ATGCCGTCCA GCACCTCGAC GAACTGACCC GCGACCTCGG CCTCGACGCG 
TTCGTCGTCT TCTCGTCCGC AGCCGCGCTC ATGGGCTCCG CCGGCCAGGG CAACTACGCG 
GCCGCCAACG CCTTCCTCGA . CGGCTTGATG GCCGGGCGCC GCGCGGCGGG CCTGCCAGGC 
GTGTCCCTGG CGTGGGGCCT GTGGGAGCAG GCGGACGGCC TGACCGCGAA CCTCAGCGCC 
ACCGACCAGG CCCGGATGAG GCGCGGCGGG GTGCTGCCGA TGACACCGGC CGAGGCCCTG 
GACATCTTCG ACATCGGCCT GGCCGCCGAG CAGGCCCTGC TGGTCCCGAT CAAGCTCGAC 
CTGCGGACGC TGCGCGGCCA GGCCACCGCC GGCGGCGAGG TGCCGCACCT GCTGCGCGGC 
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CTGGTCCGCG CGAGCCGCCG CGTGACCCGC ACGGCTGCCG CGAGTGGCGG CGGTGGCCTG 20400 

GTCCACAAGC TCGCCGGGCG GCCAGCCGAA GAGCAGGAAG CCGTGCTGCT GGGCATCGTC 20460 

CAGGCGGAGG CGGCCGCGGT GCTCGGCTTC AACGCCCCCG AGCTGGCCCA GGGCACCCGC 20520 

GGGTTCAGCG ACCTCGGCTT CGACTCGCTG ACCGCGGTCG AGCTGCGGAA CCGGCTGAGC 20580 

GCGGCGACCG GCGTCAAATT GCCCGCCACG CTCGTCTTCG ACTACCCGAC GCCGGTCGCG 20640 

CTCGCCCGCC ACCTGCGCGA AGAGCTGGGC GAGACGGTGG CGGGTGCGCC GGCCACGCCG 20700 

GTGACGACCG TCGCCGACGC GGGCGAGCCG ATCGCCATCG TCGGCATGGC GTGCCGCCTG 20760 

=1^ CCGGGCGGCG TGATGAGCCC CGACGACCTC TGGCGGATGG TCGCCGAGGG CCGCGATGGG 20820 

Ql 

AT ATGTCGCCGT TCCCCGGAGA CCGCGGCTGG GACCTGGACG GCCTGTTCGA CTCGGACCCC 20880 

til 

£3 GAGCGCCCGG GCACCGCCTA CATCCGCCAA GGCGGCTTCC TGCACGAGGC CGCGCTGTTC 20940 

GACCCGGGCT TCTTCGGGAT CTCGCCGCGC GAAGCCCTGG CCATGGACCC GCAGCAGCGG 21000 

f '"I" 

CTGCTGCTCG AAGCCTCCTG GGAAGCCCTG GAGCGCGCGG GCATCGACCC GACCAAGGCC 21060 
CGCGGTGACG CCGTCGGCGT CTTCTCCGGC GTCTCCATCC ACGACTACCT CGAGTCCCTG 21120 
AGCAACATGC CCGCCGAGCT CGAAGGCTTC GTCACCACGG CCACGGCGGG CAGCGTCGCC 21180 
TCGGGCCGGG TGTCCTACAC CTTCGGGTTC GAGGGCCCGG CGGTCACGGT GGACACGGCG 21240 
TGCTCGTCGT CGCTGGTCGC GATCCACCTG GCCGCACAGG CACTGCGGCA GGGCGAGTGC 21300 
'ACGATGGCCC TGGCCGGCGG TGTCGCCGTG ATGGGCTCGC CGATCGGTGT CATCGGCATG 21360 
TCGCGGCAGC GCGGCATGGC CGAGGACGGC CGGGTCAAGG CGTTCGCCGA CGGCGCGGAC 21420 
GGCACCGTCC TGTCCGAAGG CGTCGGCATC GTCGTCCTCG AACGGCTTTC GGTGGCCCGC 21480 
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GAACGCGGGC ACCGGGTGCT CGCCGTCCTC CGCGGCAGCG CGGTCAACCA GGACGGCGCT 21540 

TCGAACGGCC TGACCGCGCC CAACGGGCCG TCGCAGCAGC GGGTGATCCX5 CAGCGCGCTG 21600 

GCCGGGGCCG GACTGCAACC GTCCGAAGTG GACGTCGTCG AAGCGCACGG CACCGGGACC 21660 

GCGCTGGGCG AACCGATCGA AGCCCAGGCC CTGCTGGCCA CCTACGGCAA GAGCCGCGAG 21720 

ACGCCGTTGT GGCTCGGGTC GCTGAAGTCG AACATCGGCC ACACCCAGGC GGCCGCGGGC 21780 

GTGGCGGCCG TGATCAAGAT GGTCCAGGCG CTGCGGCAGG ACACCCTGCC GCCGACCCTC 21840 

M 

it- CACGTGCAGG AACCCACCAA GCAGGTGGAC TGGTCCGCGG GTGCGGTCGA GCTGCTGACC 21900 

=S3S? 

^ GAAGGCCGGG AGTGGGCCCG CAACGGCCAC CCGCGCCGGG CCGGTGTCTC GTCGTTCGGC 21960 

C]. 

^"^ ATCAGCGGCA CCAACGCGCA CCTCATCCTG GAAGAGGCGC CCGCCGACGA CACCGCCGAG 22020 

^= GCGGACGTGC CCGACGCCGT GGTCCCCGTG GTGATCTCCG CGCGCAGCAC CGGATCCCTG 22080 

C5. GCGGGCCAGG CCGGACGCCT GGCGGCGTTC CTCGACGGAG ACGTCCCGCT GACCCGCGTG 22140 

GCGGGTGCCC TGCTGTCGAC CCGGGCGACG CTGACCGACC GGGCCGTCGT CGTGGCGGGC 22200 

TCGGCCGAGG AGGCCCGGGC GGGGCTGACC GCGCTGGCCC GCGGCGAGAG CGCGAGCGGG 22260 

CTTGTGACCG GTACCGCAGG GATGCCGGGC AAGACGGTCT GGGTGTTCCC CGGCCAGGGG 22320 

ACGCAGTGGG CGGGCATGGG CCGGGAGCTC CTCGAAGCGT CCCCGGTGTT CGCCGAGCGC 22380 

ATTGAGGAAT GCGCGGCCGC GCTGCAGCCG TGGATCGACT GGTCGCTGCT GGACGTCCTC 22440 

CGTGGCGAAG GTGAGCTGGA TCGGGTCGAC GTGCTGCAGC CGGCGTGTTT CGCGGTGATG 22500 

GTGGGGCTGG CCGCCGTCTG GGCCTCGGTC GGCGTCGTGC CGGACGCGGT CCTGGGCCAC 22560 



3 5rf 
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TCCCAGGGCG AGATTGCCGC CGCCTGCGTG TCGGGTGCAC TGTCCCTCGA GGACGCAGCC 
AAGGTCGTCG CGCTGCGCAG CCAGGCGATC GCGGCGGAGC TGTCGGGCCG CGGGGGCATG 
GCGTCGATCC AGCTGAGCCA CGACGAGGTG GCTGCCCGGC TCGCGCCGTG GGCGGGCCGC 
GTCGAGATCG CCGCCGTCAA CGGTCCGGCC TCGGTCGTGA TCGCCGGTGA CGCCGAAGCG 
CTCACCGAGG CCGTCGAAGT CCTCGGCGGT CGGCGGGTGG CGGTGGACTA CGCGTCCCAC 
ACGCGGCACG TCGAGGACAT CCAGGACACC CTCGCCGAGA CTCTGGCCGG a^^TCGACGCG 
CAGGCCCCCG TGGTGCCCTT CTACTCCACG GTCGCCGGCG AGTGGATCAC CGATGCCGGG 
GTCGTCGACG GCGGGTACTG GTACCGGAAC CTGCGCAACC AGGTCGGCTT CGGCCCGGCC 
GTGGCCGAGC TGATCGAGCA GGGGCACGGG GTGTTCGTCG AGGTCAGTGC GCATCCGGTG 
CTGGTGC^^GC Ca^^TCAGCGA GCTCACCGAT GCGGTCGTCA CCGGGACGTT GCGGCGCGAC 
GACGGTGGGG TGCGGCGGCT GCTGACCTCG ATGGCCGAAC TGTTCGTCCG CGGTGTCCCG 
GTCGACTGGG CCACGATGGC GCCGCCCGCG CGCGTCGAGC TGCCGACCTA CGCCTTCGAC 
CACCAGCACT TCTGGCTCAG CCCGCCCGCC GTGGCGGACG CGCCCGCGCT CGGCCTGGCC 
GGCGCCGACC A.CCCGCTGCT GGGGGCGGTT CTCCCGCTGC CGCAGTCCGA CGGCCTGGTG 
TTCACCTCGC GCCTGTCGGT GCGGACGCAT CCGTGGCTGG CCGACGGCGT CCCCGCCGCC 
GCCTTGGTGG AGCTGGCCGT GCGGGCCGGT GACGAAGCCG GTTGCCCGGT CCTCGCCGAC 
CTGACCGTCG AAAAGCTGCT GGTGCTGCCG GAGAGCGGTG GCCTGCGCGT CCAGGTGATC 
GTGAGCGGCG AGCGCACGGT CGAGGTGTAT TCGCAGCTCG AAGGCGCCGA AGACTGGATC 
CGGAACGCCA CCGGGCACCT GTCCGCCACG GCTCCGGCGC ACGAGGCCTT CGACTTCACC 
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GCCTGGCCGC CCGCCGGAGC CCAGCAGGTC GACGGCCTCT GGCGC3CGCGG CGACGAGATC 23760 

TTCGCCGAGG TCGCCCTGCC GGAGGAGCTG GACGCCGGCG CGTTCGGCAT CCACCCCTTC 23820 

CTGCTGGACG CGGCCGTCCA GCCGGTCCTC GCGGACGACG AGCAGCCGGC GGAGTGGCGC 23880 

AGCCTGGTCC TGCACGCCGC GGGTGCCTCG GCGCTGCGCG TGCGGCTGGT GCCCGGCGGT 23940 

GCCCTCCAAG CGGCGGACGA AACCGGCGGG CTGGTCCTCA CGGCGGATTC GGTGGCAGGC 24000 

CGGGAACTCT CGGCCGGGAA GACCCGCGCC GGATCGCTGT ACCGGGTCGA CTGGACCGAA 24060 

GTGTCCATTG CAGACAGTGC GG-TCCCGGCC AACATCGAGG TCGTCGAAGC CTTCGGTGAA 24120 

GAGCCCCTCG AACTGACCGG CCGGGTCCTG GAGGCTGTGC AGACCTGGCT CGTCACCGCG 24180 

GCCGACGATG CGCGGCTCGT CGTCGTCACC CGCGGCGCCG TGCGCGAGGT GACCGACCCC 24240 

GCCGGTGCGG CCGTCTGGGG CCTGGTCCGA GCCGCGCAGG CGGAGAACCC CGGTCGCATC 24300 

TTCCTGATCG ACACCGACGG CGAGATCCCG GCCCTGACCG GTGACGAGCC CGAGATCGCG 24360 

GTCCGCGGCG GGAAGTTCTT CGTGCCCCGC ATCJ^CTCGCG CGGAGCCGAG CGGGGCCGCC 24420 

GTGTTCCGCC CGGACGGGAC AGTGCTGATC TCGGGCGCGG GTGCGCTCGG TGGCCTGGTG 24480 

GCCCGGCGTC TCGTCGAACG CCACGGCGTG CGGAAGCTCG TGCTGGCGTC CCGGCGCGGC 24540 

CGAGACGCCG ACGGCGTCGC GGACCTGGTC GCCGACCTGG CCGCGGACGT GTCCGTGGTG 24600 

GCTTCCGACG TCTCCGATCG CGCCCAGGTG GCGGCCCTGC TCGACGAGCA CCGGCCGACC 24660 

GCCGTCGTCC ACACCGCCGG CGTCATCGAC GCGGGCGTGA TCGAGACGCT GGACCGGGAC 24720 
CGGCTGGCCA CGGTGTTCGC GCCGAAGGTC GACGCCGTGC GGCACCTCGA CGAGCTGACC 24780 
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CGCGACCGCG ACCTCGACGC CTTCGTCGTC TACTCCTCGG TCTCGGCCGT GTTCATGGGC 24840 

GCGGGCAGCG GCAGTTACGC CGCGGCGAAC GCCTTCCTGG ACGGCCTGAT GGCGAACCGC 24900 

CGGGCGGCGG GCCTGCCGGG CCTCTCGCTC GCGTGGGGCC TGTGGGACCA GAGCACCGGT 24960 

ATCGCCGCCG GCACCGACGA GGCCACCCGG GCGCGGATGA GCCGCCGCGG TGGCCTGCAG 25020 

ATCATXli^GC AGGCCGAGGG CATGGACCTG TTCGACGCCG CGCTGTCGTC GGCCGAGTCG 25080 

CTGCTGGTGC CCGCCAAGCT CGACCTGCGT GGGGTGCGCG CCGACGCCGC CGCGGGCGGG 25140 

GTCGTCCCGC ACATGCTCCG TGGCCIGGTC CGCGCGGGCC GGGCGCAGGC CCGCGCGGCG 25200 

TCCACTGTGG ACAACGGGCT GGCCGGACGG CTGGCCGGGC TCGCCCCGGC GGACCAGCTC 25260 

ACGCTGCTCC TGGACCTGGT CCGGGCGCAG GTCGCGGCCG TGCTCGGGCA CGCCGACGCG 25320 

AGCGCCGTCC GCGTCGACAC GGCCTTCAAG GACGCCGGCT TCGACTCGCT GACCGCGGTC 25380 

GAGCTGCGCA ACCGCATGCG GACCGCCACC GGCCTGAAGC TGCCCGCGAC GCTCGTCTTC 25440 

GACTACCCGA ACCCCCAGGC GCTCGCCCGG CACCTGCGCXS ACGAACTCGG TGGTGCGGCC 25500 

CAGACGCCGG TGACCACAGC GGCCGCGAAG GCCGACCTCG ACGAGCCGAT CGCCATCGTC 25560 

GGGATGGCGT GCCGCTTGCC GGGCGGGGTC GCCGGGCCCG AGGACCTCTG GCGGCTGGTC 25620 
GCCGAGGGCC GGGACGCGGT GTCGAGCTTC CCGACCGACC GCGGCTGGGA CACCGACAGC 25680 
CTGTACGACC CCGATCCGGC CCGCCCGGGC AAGACCTACA CCCGGCACGG CGGCTTCCTG 25740 
CACGAAGCCG GGCTCTTCGA CGCGGGCTTC TTCGGGATCT CGCCACGCGA GGCCGTCGCC 25800 
ATGGACCCGC AGCAGCGGCT GCTGCTGGAG GCCTCCTGGG AGGCCATGGA AGACGCCGGG 25860 
GTCGACCCAC TTTCGCIGAA GGGCAACGftC GTCGGCGTGT TCACCGGCAT GTTCGGCCAG 25920 
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GGTTACGTCG CTCCCGGGGA CAGCGTCGTC ACGCCGGAGC TGGftGGGTTT CGCGGGCACG 25980 

GGCGGGTCGT CGAGTGTCGC GTCCGGCCGC GTGTCGTACG TGTTCGGGTT CGAAGGCCCG 26040 

GCCGIGACGA TCGACTCGGC GTGCTCGTCC TCGCTGGTCG CGATGCACCT CGCCGCGCAG 26100 

TCGCTGCGGC AGGGCGAGTC CTCGAOXSGCC TTGGCCGGCG GCGCGACGGT GATGGCGAAC 26160 

CCCGGCGCAT TCGTGGAGTT CTCGCGGCAG CGGGGCCTCG CCGTCGACGG TCGCTGCAAG 26220 

GCGTTCGCCG CCGCGGCCGA CGGCACCGGC TGGGCCGAGG GCGTCGGTGT GGTCATCCTC 26280 

GAGCGGCTGT CGGTGGCGCG GGAACGCGGC CACCGGATCC TGGCCGTGCT GCGCGGCAGC 26340 

GCGGTCAACC AGGACGGCGC CTCGAACGGC CTGACCGCGC CGAACGGGCC GTCGCAGCAG 26400 

CGGGTGATCC GCCGGGCGCT GGTGAGCGCC GGGCTGGCAC CGTCCGATGT GGACGTCGTC 26460 

GAGGCGCACG GCACCGGGAC CACGCTGGGT GACCCGATCG AGGGGCAAGC TCTGCTGGCT 26520 

ACCTACGGCA AGGACCGCGA GTCGCCGCTC TGGCTCGGCT CGCTGAAGTC GAACATCGGC 26580 

CACGCGCAGG CCGCCGCGGG GGTCGCCGGC GTCATCAAGA TGGTCCAGGC GCTCCGGCAC 26640 

GAAGTCCTCC CGCCGACGCT GCACGTCGAC CGGCCTACCC CCGAGGTCGA CTGGTCGGCC 26700 

GGTGCCGTCG AACTGCTGAC GGAAGCCCGC GAGTGGCCGC GCAACGGGCG CCCGCGCCGG 26760 

GCCGGGGTCT CCGCGTTCGG CGTCAGCGGC ACGAACGCGC ACCTaATCCT GGAGGAGGCG 26820 

CCCGCCGAAG AGCCGGTGCC CACACCCGAG GTTCCCCTGG TGCCGGTCGT GGTCTCCGCG 26880 

CGGAGCAGGG CGTCCCTGGC CGGTCAGGCC GGTCGCCOXZG CCGGATTCGT GGCGGGTGAC 26940 

GCGTCCTTGG CCGGTGTGGC CCGGGCGCTG GTGACGAACC GGGCCGCGCT GACCGAGCGC 27000 



LJ 
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GCGGTCATGG TCGTGGGCTC TCGCGAAGAA GCCGTGACGA ACCTGGAAGC GCTGGCCCGC 27060 

GGCGAAGACC CGGCCGCGGT GGTCACCGGC CGGGCGGGTT CGCCGGGCAA GCTCGTCTGG 27120 

GTCTTCCCCG GCCAGGGCTC GCAGTGGATC GGGATGGGCC GGGAACTCCT GGACTCTTCG 27180 

CCGGTCTTCG CCGAGCGGGT CGCCGAATGC GCGGCCGCCC TGGAACCGTG GATCGATTGG 27240 

TCACTGCTCG ACGTGCTGCG CGGGGAGTCC GACCTGCTGG ACCGGGTCGA CGTCGTGCAG 27300 

CCCGCCAGCT TCGCGATGAT GGTCGGCCTG GCCGCGCTGT GGCAGTCGGT GGGTGTCCGC 27360 

CCGGATGCCG TCGTCGGCCA COXJGCAGGGC GAGATCGCCG CCGCX:TGCGT CTCGGGCGCG 27420 



f, CTGTCGCTGC AGGACGCCGC GRAGGTGGTT GCCTTGCGCA GCCAGGCGAT CGCCACCCGG 27480 



fii 

s -a* 



S"^. CTGGCCGGGC GCGGCGGCAT GGCTTCCGTG GCGTTGAGCG AAGAAGACGC GACCGCGTGG 27540 

s 

O CTGGCGCCGT GGGCCGACCG GGTCCAGGTC GCCGCGGTCA ACAGCCCTGC CTCCGTGGTG 27600 

5 ATCGCCGGGG AAGCCCAGGC CCTCGACGAG GTCGTCGACX3 CGTTGTCCGG TCAQGAAGTC 27660 

^ ^ CGCGTCCGGC GGGTGGCCGT Gai^CTACGGG TCCCACACCA ACCAGGTCGA AGCCATCGAG 27720 

G&TCTGCTGG CCGAGACCTT GGCCGGCATC aAGGCGCAGG CCCCGAAGGT GCCCTTCTAC 27780 

TCGACCCTGA TCGGTGACTC GATCCGTGAC GCCGGGATCG TCGACGGCGG CTACTGGTAC 27840 

CGS-i^ACCTGC GCAACCAGGT CX3GGTTCGGT CCGGCCGTCG CGGAGCTCGT TCGCCAGGQC 27900 

CACGGGGTGT TCGTCGAGGT CAGCGCGCAC CCGGaXX:TGG TCCAGCCGCT CAGTGAACTC 27960 

AGCGACGACG CGGTGGTGAC CGGGTCGCTG CGGCGCGAAG ACGGTGGCCT GCGCCGCCTG 28020 

CTCACGTCGA TGGCCGAGCT GTACGTGCAG GGTGTCCCGC TCGACTGGAC CGCGGTCCTG 28080 

CCGCGGfiCCG GCCGGGTCGA CCTGCCGAAG TACGCCTTCG ACCACCGGCA CTACTGGCTG 28140 
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CGGCCCGCCG AGTCCGCGAC CGACGCGGCT TCGCTGGGCC AGGCGGCGGC CGACCACCTG 28200 

CTGCTGGGCG CGGTCGTCGA GCTGCCGCAG TCCGACGGCC TGGTGITCAC CTCGCGGCTG 28260 

TCCGTGCGGA CGCACCCGTG GCTGGCCGAC CACGCGGTCG GTGGCGTGGT CATCCTCCCC 28320 

GGCTCCGGGC TGGCCGAACT GGCCGTCCGG GCCGGCGACG AAGCCGGGTG CACCGCCCTC 28380 

GACGAGCTGA TCATCGAAGC TCCGCTGGTC GTGCCCGCCC AAGGCGCGGT <X;GCGTCCAG 28440 

GTCGCGTTGA GCGGCCCGGA CGAGACCGGC TCGCGCACGG TGGACCTCTA CTCCCAGCGC 28500 

GACGGCGGCG CGGGGACGTG GACGCGGCAC GCCACCGGCG TGCTGTCGAC GGCCCCCGCT 28560 

CAGGAACCCG AGTTCGACTT CCACGCCTGG CCGCCCGCGG ATGCCGAGCG GATCGACGTC 28620 

GAGACCTTCT ACACCGACCT GGCCGAGCGT GCTTACGGCT ACGGGCCGGC GTTCCAGGGG 28680 

CTGCAAGCGG TGTCGCGGCG TGACGGCGAC GTCTTCGCCG AGGTCGCCCT GCCCGAGGAC 28740 

CTGCGCAAGG ACGCGGGCCG GTTCGGCGTC CACCCGGCGC TGCTCGACGC GGCGCTGCAG 28800 

GCCGCCACGG CCGTGGGCGG CGACGAGCCC GCTCAGCCGG TGCTGGCGTT CGCGTGGAAC 28860 

GGCCTGGTCC TCCACGCCGC GGGCGCGTCG GCCCTGCGGG TCCGGCTCGC GCCGAGCGGC 28920 

CCGGACACGC TGTCCGTGGC AGCCGCCGAC GAAACCGGCG GCTTGGTCCT GACCATGGAA 28S80 

TCGCTCGTCT CCCGGCCGGT TTCGGCCGAG CAGCTCGGCG CCGCGGCCGA CGCGGGCCAC 29040 

GACGCGATGT TCCGCGTCGA CTGGACCGAG CTGCCTGCCG TGCCCCGCGC GGAACTGCCG 29100 

CCGTCGGTGC GGATCGACAC CGCCGACGAC GTCGCGGCCT TGGCGGAGAA GGCGGACGCA 29160 

CCACCGGTCG TCGTCTCGGA AGCCGCCGGG GGAGACCCGG CCCTGGCCGT GA(3TTCCCGG 29220 
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GTGCTCGAGA TCATGCAGGC CTGGCTGGCC GCGCCCGCGT TCGAGGAGGC CCGGCTGGTC 
GTGACGACCC GCGGCGCGGT ACCCGCCGGC GGTGACCACA CACTGACCGA CCCGGCCGCG 
GCCGCGGTGT GGGGCCTGGT CCGGTCCGCG CAGGCGGAAC ACCCGGACCG GGTCGTCCTG 
CTGGACACCG ACGGCGAAGT TCCGCTGGGC GCGGTGCTGG CCTCCGGTGA GCCGCAGCTC 
GCGGTGCGCG GAACGACGTT CTTCGTGCCC CGGCTGGCCC GCGCCACCCG GCTCTCGGAC 
GCGCCTCCTG CGTTCGACCC GGACGGGACC GTGCTGGTCT CGGGCGCCGG ATCGCTGGGC 
ACCTTGGTGG CCCGGCACCT GGTCACCCGG CACGGCGTGC GCCGGGTGGT GCTGGCCAGC 
CGGCAGGGCC GGGACGCCGA GGGCGCCCAG GACCTGATCA CCGAGCTCAC CGGCGAAGGC 
GCGGACGTCT CCTTCGTGGC CTGTGACGTC TCCGATCGCG ACCAGGTGGC CGCGCTGCTC 
GCGGGCCTCC CGGACCTGAC CGGQGTGGTG CACACCGCCG GCGTCTTCGA GGACGGCGTG 
ATCGAGGCGC TGACGCCCGA CCAGCTCGCG AACGTGTACG CGGCCAAGGT CACGGCCGCG 
ATGCACC1CG ACGAGCTCAC CCGCGACCGG GATCTCGGCG CGTTCGTCGT GTTCTCCTCC 
GTCGCGGGGG TGATGGGTGG TGGCGGTCAA GGCCCGTACG CGGCGGCGAA CGCCTTCCTG 
GACGCGGCGA TGGCGRGTCG TCAGGCCGCG GGCCTGCCGG GCCTGTCCCT GGCGTGGGGC 
CTCTGGGAfiC GCAGCAGCGG CATGGCX;GCC CACCTCAGCG AGGTCGACCA CGCGCGGGCG 
AGCCGCAACG GTGTCCTGGA ACTGACCCGG GCCGAGGGCC TGGCGCTGTT CGACCTCGGG 
CTGCGGATGG CCGi«3TCGCT GCTCGTGCCG ATCAAGCTCG ACCTCGCCGC GATGCGGGCG 
AGCACGGTCC CGGTCCTGTT CCGCGGCCTG GTCCGGCCGA GCCGGACCCA GGCGCGCACG 
GCGTCCACTC TGGACCGGGG GCTGGCCGGG CGGCTCGCCG GGCTGCCGGT GGCCGAGCGG 




GCGGCGGTGC TCGTCGACCT GGTCCGCGGG CAGGTCGCGG TCGTGCTCGG CTACGACGGG 30420 

CCGGAGGCCG TCCGCCCGGA CACGGCGTTC AAGGACACCG GGTTCGACTC GCTGACGTCG 30480 

GTGGAACTGC GCAACCGGCT GCGCGAGGCG ACCGGGCTCA AGCTCCCCGC CACGCTCGTC 30540 

ITCGACTACC CGAACCCCTT GGCGGTCGCG CGCTACCTGG GCGCGCGGCT GGTCCCGGAC 30600 

GGGACCGCGA ACGGCAACGG GAACGGGAAT GGGCACAGCG AAGACGACCG GCTGCGGCAC 30660 

GCGCTGGCGG CCATCGCGGC CGAGGACGCG GGCGAGGAGC GGTCGATCGC CGACCTGGGC 30720 

GTCGACGACC TCGTGCAACT GGCTTTCGGC GACGAGTGAT TGGGGCAAGT GGTGAGTGCG 30780 

TCGTATGAAA AGGTCGTCGA GGCGCTGCGG AAGTCGCTCG AAGAGGTCGG CACGCTGAAG 30840 

AAGCGGAACC GGCAGCTCGC CGACGCGGCC GGCGAGCCGA TCGCCATCGT CGGCATGGCC 30900 

TCCCGGCTGC CCGGTCGCGT CACCGGGCCC GGTGACCTCT GGCGGCTGGT GGCCGAGGGC 30960 

GGCGACGCCG TCTCGGGGTT CCCCACCGAC CGCTGCTGGG ACCTGGACAC CCTGTTCGAC 31020 

CCGGATCCCG ACCACGCGGG GACGTCGTAC ACCGACCAGG GCGGCTTCCT CCACGACGCG 31080 

GCCCTGTTCG ACCCGGGCTT CTTCGGGATT TCGCCGCGCG AGGCGCTGGC CATGGACCCG 31140 

CAGCAGCGGT TGCTGCTGGA GGCGTCCTGG GAGGCGCTGG AAGGTGTCGG CCTCGACCCG 31200 

GCTTCGTTGC AGGGCACCGA CGTCGGCGTG TTCACCGGCG CGGGCGGGTC GGGCTACGGC 31260 

GGCGGCCTCA CCGGGCCGGA GATGCAGAGT TTCGCGGGCi^ CCGGGCTGGC CTCGAGCGTG 31320 

GCTTCGGGCC GGGTGTCCTA CGTCTTCGGG TTCGAGGGAC CGGCGGTCAC GATCGACACG 31380 

GCGTGCTCGT CGTCGCTGGT GGCGATGCAC CTCGCCGCGC AGGCCCTGCG CCAAGGCGAC 31440 
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TGCTCGATGG CACTGGCCGG CGGCGCGATG GTGATGTCGG GCCCCGACTC CTTCGTCGTC 31500 

TTCTCCCGGC AGCGGGGGCT GGCCACCGAC GGGCGGTGCA AGGCGTTCGC GTCGGGCGCC 31560 

GACGGCATGG TGCTCGCCGA GGGCATCAGC GTGGTCGTGC TGGAGCGGCT TTCGGTCGCG 31620 

CGGGAACGCG GGCACCGGGT GCTGGCCGTG CTGCGCGGCA GCGCGGTGAA CCAGGATGGC 31680 

GCGTCGAACG GCCTGACCGC CCCGAACGGC CCTTCCCAGC AGCGCGTGAT CCGCGCCGCG 31740 

CTGGCCAACG CCGGAATCGG ACCGTCCGAT GTGGACCTCG TCGAGGCGCA CGGGACCGGG 31800 

ACGAGCCTGG GTGATCCCAT CGAGGCGCAG GCCTTGCTGG CGACCTACGG CCAGGACCGG 31860 

GAGACGCCGt TGTGGCTCGG CTCGCTGAAG TCG2^ACATCG GGCACACGCA GGCGGCCGCG 31920 

GGCGTGGCGA GCGTaJ^TCAA GGTCGTGCAG GCGCTGCGGC ACGGCGTCAT GCCGCCGACC 31980 

CTGCACGTCG ACGAGCCCAG CTCGCAGGTC GACTGGTCCG AAGGCGCGGT GGAACTGCTG 32040 

ACCGGGAGCC GGGftCTGGCC GCGCGGGGAC CGGCCGCGCC GGGCCGGGGT GTCGTCGTTC 32100 

GGCGTCAGCG GGACGAACGT GCACCTGATC ATCGAGGAAG CCCCCGAGGA GCCCGCTGCG 32160 

GCCGTGCCGA CGTCCGCGGA CGTCGTGCCG CTGGTGGTTT CCGCACGCAG CACGGGTTCC 32220 

CTGGCCGGTC AGGCCGACCG GCTGfiCCGAG GTGGACGTCC CCCTCGGACA CCTCGCCGGG 32280 

GCGCTGGTGG CCGGGCGCGC GGTGCTCGAG GAACX3CGCGG TCGTGGTCGC CGGTTCGGCC 32340 

GAAGAAGCCC GCGCGGGGCT GGGTGCGCTG GCTCGCGGTG AAGCCGCGCC CGGCX3TCGTG 32400 

ACCGGGACCG CGGGCAftGCC GGGCAAGGTC GTCTGGGTGT TCCCGGGACA GGGGACGCAG 32460 

TGGGTGGGCA TGGGCCGGGA GCTCCTCGAC GCGTCCCCGG TGTTCGCCai^ GCGGATCAAG 32520 

GAGTCCGCGG CGGCACTGGA CCAGTGGACC GACTGGTCGC TGCTGGACGT CCTGCGTGGT 32580 
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GACGGTGACC TGGATTCTGT CGAGGTGCTG CAGCCCGCGT GCTTCGCGGT GATGGTGGGG 32640 
CTGGCCGCGG TCTGGGAGTC GGCGGGGGTC CGGCCGGACG CCGTCGTCGG CCACTCGCAG . 32700 

GGCGAGATCG CCGCGGCCTG CGTGTCCGGC GCGCTCACCC TCGACGACGC CGCGAAGGTG 32760 

GTGGCCCTGC GCAGCCAGGC GATCGCGGCG CGGCTGTCCG GCCGCGGCGG GATGGCGTCG 32820 

GTCGCGTTGA GCGAGGACGA GGCGAACGCA CGGCTGGGTT TGTGGGACGG CCGGATCGAG 32880 

GTGGCCGCGG TCAACGGCCC CGCCTCCGTG GTGATCGCGG GQGACGCCCA AGCCCTCGAC 32940 

?i GAGGCTTTGG AGGTGCTGGC CGGGGACGGC GTCCGCGTCC GGCAGGTCGC GGTCGACTAC 33000 

GCCTCCCACA CCCGGCACGT CGAGGACATC CGCGACACCC TCGCCGAGAC GCTGGCCGGG 33060 

Si 

m 

^^""^ ATCACCGCGC AGGCCCCGGA CGTGCCGTTC CGCTCCACCG TCACCGGCGG CTGGGTGCGG 33120 

3 

H GACGCCGACG TCCTGGACGG CGGGTACTGG TACCGCAACC TGCGCAACCA GGTCCGGTTC 33180 

o. ■ 

□ GGCCCGGCCG TGGCCGAGCT GCTCGAGCAG GGCCACGGGG TGTTCGTCGA GGTCAGCGCC 33240 

CACCCCGTCC TGGTCCAGCC GATCAGCGAG CTCACCGACG CGGTCGTCAC CGGGACGCTG 33300 

CGGCGCGACG ACGGCGGCCT GCGCCGCCTG CTGACGTCGA TGGCCGAGCT GTTCGTCCGC 33360 

GGTGTTCGCG TCa^^CTGGGC CACGCTGGTG CCGCCCGCGC GCGTGGACCT CCCGACGTAC 33420 

GCCTTCGACC ACCAGCACTT CTGGCTCCGG CCGGCCGCGC AGGCGGACGC CGTCTCGCTC 33480 

GGCCAGGCCG CGGCGGAGCA CCCGCTGCTC GGCGCGGTCG TCCGGCTGCC GCAGTCGGAC 33540 
GGCCTGGTCT TCACCTCGCG GCTGTCGCTG CGGACGCACC CGTGGCTGGC CGACCACACC 33600 
ATCGGCGGCG TGGTGCTGTT CCCCGGCACC GGGCTGGTCG AACTGGCCGT GCGGGCCGGC 33660 
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GACGAGGCCG GGTGCCCGGT CCTGGACGAA CTCGTGACCG AGGCGCCGCT GGTCGTGCCC 33720 

GGGCAGGGCG GAGTCAACGT CCAGGTCACG GTGAGCGGCC CGGACCAGAA CGGCTTGCGC 33780 

ACGGTGGACA TCCACTCCCA GCGCGACGAC GTGTGGACCC GGCACGCGAC CGGAACGGTC 33840 

TCGGCGACCC CGGCGAGCAG CCCCGGCTTC GACTTCACCG CGTGGCCGCC GCCGGACGGG 33900 

Ci^GCGCGTCG AGATCGGCGA CTTCTACGCC GACCTCGCCG AGCGCGGGTA CGCGTACGGG 33960 

CCCTTGTTCC AGGGCGTGCG GGCGGTGTCG CAGCGCGGCG AAGACGTGTT CGCCGAGGTC 34020 

GCGCTGCCCG AAGACCGGCG GGAGGACGCC GCCCGGTTCG GCCTGCACCC GGCGTTGCTG 34080 

GACGCGGCCC TCCAGACCGG GACGATCGCC GCGGCCGCGT CCGGTCAGCC GGGCAAGTCC 34140 

GTGATGCCGT TCTCGTGGAA CCGGCTGGCG CTGCACGCCG TCGGGGCCGC GGGCCTCCGG 34200 

GTCCGCGTGG CCCCCGGCGG ACCGGACGCG CTGACCGTCG AGGCCGCCGA CGAGACCGGC 34260 

GCCCCGGTCC TCACCATGGA CTCGCTGATC CTGCGTGAAG TCGCCCTCGA CCAGCTGGAC 34320 

ACTGCGCGCG CCGGCTCGCT CTACCGGGTG GACTGGACGC CACTGCCCAC TGTGGACAGT 34380 

GCGGTGCCCG CTGGTCGGGC CGAGGTGCTG GAAGCTTTCG GCGAGGAGCC CCTGGACCTG 34440 

ACCGGCCGGG TGCTGGCCGC CCTGCAGGCG TGGCTTTCCG ACGCGGCGGA GGAAGCCCGC 34500 

CTQGTCGTGG TGACCCGGGG TGCGGTGCCC GCCGGAGACG GTGTGGTGAG CGATCCGGCG 34560 

GGTGCCGCGG TGTGGGGCCT GGTCCGGGCC GCGCAGGCGG AGAACCCGGA CCGGTTCGTC 34620 

CTCCTCGACA CCGACGGCGA GGTCCCGCTG GAAGCGGTGC TGGCGACCGG TGAGCCX3CAG 34680 

CTCGCGCTGC GCGGCACGAC GTTCTCGGTG CCCCGGCTCG CCCGCGTCAC CGAACCGGCG 34740 

GAAGCCCCGC TGACGTTCCG TCCGGACGGG ACGGTCCTGG TCTCCGGCGC CGGGACGCTG 34800 
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GGTGCGCTCG CCGCCCGCGA CCTCGTCACC CGGCACGGCG TCCGGCGGCT CGTGCTGGCC 34860 

AGCCGGCGCG GCCGGGCCGC CGAGGGCATC GACGACCTCG TCGCCGAGCT GACCGGGCAC 34920 

GGCGCCGAAG TCACGGTCGC CGCCTGCGAC GTCTCCGACC GCGACCAGGT GGCGGCGCTG 34980 

CTCAAGGAAC ACGCGCTCAC CGCGGTGGTG CACACGGCGG GCGTGTTCGA CGCCGGTGTC 35040 

ACCGGCGCGC TGACCCGGGA GCGGCTGGCC AAGGTGTTCG CGCCCAAGGT CGACGCGGCC 35100 

1,=, AftCCACCTCG ACGAGCTGAC CCGCGACCTG GACCTCGACG CGTTCATCGT CTACTCGTCC 35160 

"P GCCTCCTCGA TCTTCATGGG CGCGGGCAGC GGCGGGTACG CGGCGGCGAA CGCCTACCTC 35220 

2 GACGGCCTGA TGGCCGCCCG GCGCGCGGCG GGCCTGCCGG GGCTGTCGCT GGCCTGGGGC 35280 

Li 5 

m 

= CCGTGGG-AGC AGCTCACCGG CATCGCCGAC ACCATCGACG ACCTCACCCT GGCCCGGATG 35340 

b AGCCGGCGCG AAGGCCGCGG CGGCGTCCGC GCGCTCGGCT CCGCCGACGG CATGGAGCTG 35400 

Li j 

il TICGACGCCG CGCTCGCGGC CGGGCAGGCG CTGCTGGTGC CGATCGAGCT CGACCTGCGC 35460 

GAGGTGCGGG CCGACGCGGC CGGCGGCGGC ACGGTGCCGC ACCTGCTGCG CGGGCTGGTC 35520 

CGCGCGGGCC GGCAGGCGGC GCGGACGGCG GCCACCGAGG ACGGCGGCCT GGAACGCCGG 35580 

CTGGCCGGGC TCACCGTCGC CGAACAGGAA GCGCTGCTGC TCGACCTCGT CCGCGGTCAG 35640 

GTCGCCGTCG TGCTCGGGCA CGCCGACAGC TCCGGCGTCC GCGCCGACGC GGCGTTCAAG 35700 

GACGCCGGGT TCGACTCGCT GACGTCGGTG GAGCTGCGCA ACCGGCTGCG CGAGACGACC 35760 

GGCCTGAAAC TGCCCGCGAC GCTGGTCTTC GACCATCCGA ACCCGCTGGC ACTGGCCCGG 35820 

CACCTXSCGGG CGGAACTCGC CGTCGACGAG GCATCCCCGG CCGATGCGGT GCTGGCCGGG 35880 



Q 
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CTCGCCGGGC TGGAGGCGGC CATCGCGGCC GCCGGCGCCC CGGT^GGCGA CCGGATCACC 35940 

GCGCGGCTGC GGa^^CTGCT CAAGGCCGCC GAGGCGGCCG AGGCCCGGCC GGGCACCTCC 36000 

GGCGATCTCG ACACGGCCAG CGACGAGGAA CTGTTCGCCC TCGTCGACGG GCTCGACTGA 36060 

AACCGCTGTG ACATCCGGGG CTTCGCCACC CGGGCCCCGA AAAGCAAGCA CACGTGAGAG 36120 

TOXZTGGGAGT TGAGTTCAGT GGCTGACGAG GGACAACTCC GCGACTACCT CAAGCGGGCC 36180 

ATCGCCGACG CCCGCGACGC CCGCACGCQG CTGCGCGAQG TCGAGGAGCA GGCGCGGGAG 36240 

CCGATCGCCA TCGTCGCCAT GGCGTGCCGG TACCCGGGCG GGGTGTCCTC GCCCGAGGAC 36300 

CTGTGGCGGC TGGTGGCCGA GGGGACCGAC GCCGTCTCCG CGTTCCCCGG CGACCGCGGC 36360 

¥,1 TCGGACGTCG ACGGGCTCGT CGACCCGGAC CCCGACCGCC CGGGCACGAC GTACACGGAC 36420 

: IT 

P CAGGGTGGCT TCCTCCACGA GGCCGGCCTC TTCGACGCGG GGTTCTTCGG GATCTCGCCG 36480 

y3 CGGGAGGCCG TCGCGATGGA CCCGCAGCAG CGGCTGCTGC TGGAGACGTC CTGGGAGGCC 36540 

Z 3 
"CSC 

HI* 

ATCGAACGCA CCGGCACCGA CCCGCTTTCG CTGAAGGGC\ GCGACATCGG CGTCTTCACC 36600 

GGCGTCGCGA GC^TGGGTTA CGGCGCCGGT GGCGGCGTGG TCGCGCCGGA GCTGGAGGGT 36660 

TTCGTCGGCA CCGGTGCGGC GCCGTGCATC GCGTCCGGCC GGGTGTCGTA CGTCCTCGGC 36720 

TTCGAAGGCC CGGCGGTCAC CGTCGACACC GGGTGTTCGT CGTCGCTGGT GGCGATGCAC 36780 

CTCGCCGCGC AGGCGCTGCG GCGGQGTGAG TGCTCGATGG CTCTGGCCX^G CGGCGCGATG 36840 

GTGATGGCCC AGCCGGGTTC GTTCGTGTCC TTCTCGCGGC AACGCGGGCT CGCCCTGGAC 36900 

GGGCGCTGCA AGGCGTTTTC GGACAGCGCC GACGGGATGG GACTGGCCGA GGGCGTCGGC 36960 

GTCATCGCGC TGGAACGGCT GTCGGTCGCC CGTGAGCGTG GGCACCGGGT GCTGGCCGTG 37020 
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CTGCGCGGTA TCGCGGTGAA CCAGGATGGC GCGTCGAACG GCTTGACCGC CCCGAACGGC 37080 

CCGTCCCAGC AGCGGGTGAT CCGCGCCGCG CTGGCCGAAG CCGGGCTGTC GCCGTCCGAT 37140 

GTGGACGCCG TCGAAGGGCA CGGGACGGGC ACGACGCTGG GCGATCCGAT CGAAGCGCAG 37200 

GCGTTGCTGG CCACCTACGG CAAGGGCCGG GACCCGGAGA AGCCGCTCTG GCTGGGCTCG 37260 

GTGAAGTCGA ACCTCGGGCA CACGCAAGCG GCCGCGGGCG TGGCCAGCGT GATCAAGATG 37320 

GTGCAGGCGC TGCGCCACGG CGTGCTGCCC CCGflCGCTGC ACGTCGACCG GCCGTCCACC 37380 

GAAGTCGACT GGTCGGCCGG TGCGGTCTCG CTGTTGACGG AGGCTCGGGA GTGGCCGCGC 37440 

GAAGGGCGGC CGCGCCGGGC CGGGGTGTCC TCGTTCGGGA TCAGCGGGAC CAACGCGCAC 37500 

CTCATCCTCG AGGAAGCGCC CGAGGAGGAG CCGCCCGTCG CCGAAGCGCC TTCCGCCGGA 37560 

GTGGTGCCCG TGGTGGTCTC GGCTCGTGGG GCCCTGGCGG GTCAGGCCGG CCGGCTGGCC 37620 

GCGTTCCTCG AGGCGTCCGA CGAGCCGTTG GTGACCGTCG CCGGGGCGCT GATCTGCGGC 37680 

CGGTCCCGGT TCGGCGACCG GGCCGTCGTG GTGGCGGGCA CGCGCGCAGA GGCGACGGCC 37740 

GGGCTGGCCG CGCTGGCCCG CGGCGAAAGC GCCGCCGACG TCGTGACCGG GACGGTCGCG 37800 

GCCTCGGGCG TGCCGGGCAA GCTCGTGTGG GTGTTCCCGG GCCAGGGTTC GCAGTGGGTG 37860 

GGCATGGGCC GGGAGCTCCT CGAAGCCTCG CCGGTGTTCG CCGCGCGGAT CGCGGAGTGC 37920 

GCGGCTGCCC TCGAACCGTG GATCGACTGG TCGCTGCTGG ACGTCCTCCG TGGCGAGGGC 37980 

GACCTCGACC GCGTCGACGT GGTGCAGCCC GCGAGTTTCG CGGTGATGGT CGGCCTGGCC 38040 

GCGGTGTGGT CGTCCGTCGG GGTGGTGCCC GACGCGGTGC TCGGGCACTC GCAGGGGGAG 38100 
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ATCGCGGCGG CGTGCGTGTC GGGGGCGTTG TCGCTGCAGG ACGCGGCGAA GGTGGTCGCG 3816C 

TTGCGCAGCC AGGCGATCGC GGCGAAGCTG GCCGGCCGCG GCGGCATGGC CTCGGTCGCG 38220 

CTCAGCGAGG AAGACGCGGT CGCGCGGTTG CGGCACTGGG CGGACCGGGT CGAGGTGGCC 38280 

GCGGTCAACA GCCCGTCGTC GGTGGTGATC GCCGGCGACG CCGAAGCCCT CGAGCAGGCC 38340 

CTCGAAGCAC TGACCGGCCA GGACATCCGG GTCCGGCGGG TGGCGGTGGA CTACGCCTCG 38400 

CACACCCGGC ACGTCGAAGA CATCCAGGAG CCCCTCGCCG AGGCACTGGC CGGGATCGAG 38460 

C5 ■ GCGCACGCGC CGACCCTGCC GTTCTTCTCG ACCCTCACCG GTGACTGGAT TCGCGAAGCG 38520 

sfs: 
H I 

m GGCGTCGTCG ACGGCGGCTA CTGGTACCGG AACCTGCGCA ACCAGGTCGG TTTCGGCCCG 38580 

fsa * 

GCGGTGGCCG AGCTGCTCGG CCTCGGCCAC CGGGTGTTCG TCGAGGTCAG CGCGCACCCC 38640 

GTGCTCGTCC AGGCGATCAG CGCGATTGCC GACGACACCG ACGCGGTCGT CACCGGCTCG 38700 

o: 

C3 GTCCGGCGGG AGGAGGGCGG CCTCCGGCGG COXSCTCACGT CGATGGCCGA GCTGTTCGTC 38760 

CGCGGAGTCG ACGTGGACTC GGCCACGATC GTCCCGCCAG CGCGGGTCGA TTTCCCGACC 38820 

TACGCCTTCG ACCACCAGCA CTACTCGCTC CGGTACGTCG AGACCGCGAC CGACGCGGCC 38880 

GGTCCGGTCG TCCGGCTCCC GCAGACGGGC GGCCTCGTCT TCACCACCGA GTCGTCGCTC 38940 

AAGTCACAGC CGTCGCTCGC CGAGCACACC CTGGAAGACC TGGTCGTCGT CCCCGGCGCG 39000 

GCACTCGTCG AGCTCGCCGT CCGGGCCGGT GACGAGGCCG GGACCCCGGT GCTCGACGAA 39060 

CTCGTCATCG AGACGCCCCT GGTCGTCCCG GAACGCGGCG CGATCCGGGT GCAGGTCACG 39120 

GTGAGCGGAC CGGAGGACGG CACACGGACC CTCGAAGTGC ATTCCCAGCC CGAAGACGCC 39180 

ACCGACGAAT GGACCCGGCA CGCCACCGGC ACGCTCTCGG CGACCCCGGA CGAAAGCAGC 39240 



I 
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GGGTTCGACT TCACGGCCTG GCCGCCCCCG GGCGCCCGGC AGCTCGACGG CGTTCCGGCG 39300 

ATCTGGCGGG CCGGCGACGA GATCTTCGCC GAAGTCTCCC TGCCCGACGA TGCGGACGCC 39360 

GAGGCATTCG GCATCCACCC CGCGCTCCTG GACGCGGCCC TGCACCCCGC CCTGCCCGGC 39420 

aiVTGACGGTC TCACGCAGCC CATGGAATGG CGTGGCCTGA CGCTGCACGC CGCGGGGGCG 39480 

TCGACGCTGC GGGTCCGGTT GGTGCCCGGC GGGTTCCTGG AAGCGGCCGA CGGCGCCGGC 39540 

1.4, AGCCTGGTCG TCACGGCGAA GGAGGTTGCC CTCCGCCCGG TGACGATCGC GCGGTCGCGC 39600 

b 

% ACCACCACCC GAGACTCGCT GTTCCAGCTG AACTGGATCG AGCTGCCCGA GAGTGGCGTG 39660 

m 

GTXSGCCGCGG CAGACGACAC CGAGGTGCTG GAGGTGCCCG CGGGCGATTC CCCGCTGGCG 39720 

01 

yi ■ 

s GCGACCTCCC GAGTCTTGGA GCGGCTCCAG ACCTGGCTGA CCGAGCCCGA GGCGGAACAG 39780 

Q 

Q CTGGTCGTCG TCACGCGCGG CGCGGTGCCC GCCGGGGACA CCCCGGTGAC CGACCCGGCC 39840 

□■ 

GCGGCGGCGG TCTGGGGCCT GGTCCGGTCC GCGOVGGCGG AGAACCCGGA CCGGATCGTC 39900 

CTGCTCGACA CCGACGGCGA AGTCCCGCTG GGTGCGGTGC TGGCCGGCGG CGAGCCGCAG 39960 

GTCGCGGTGC GCGGCACGGC GCTGTACGTC CCGCGCCTGG CCCGCGCCGA CGCGGCCCCG 40020 

GTATCCGGTC TACATGGGAC GGTCCTCGTC TCCGGTGCCG GTGTGCTCGG CGAGATCGTG 40080 

GCGCGGCACC TGGTCACCCG CCACGGCGTG CGCAAGCTGG TGCTCGCCAG CCGCCGCGGC 40140 

CTGGACGCCG ACGGCGCGAA GGACCTCGTC ACCGACCTCA CCGGCGAGGG CGCGGACGTG 40200 

TCCGTCGTCG CCTGCGACCT GGCCGATCGG AACCAGGTGG CCGCGCTGCT GGCCX3ACCAC 40260 

CGCCCGGCGA GCGTCATCCA CACGGCGGGC GTCCTCGACG ACGGCGTCAT CGGGACGCTG 40320 
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ACCCCGGAGC GGCTGGCCAA GGTGTTCGCG CCCAAGGTCG ACGCGGTCCG CCATCTCGAC 40380 

GAGCTCACTC GCGACCTCGA CCTCGACGCG TTCGTCGTGT TCTCCTCCGG CTCCGGCGTG 40440 

TTCGGTK:GC CGGGGCAGGG CAACTACGCG GCGGCGAACG CGTTCCTGGA CGCGGCGATG 40500 

GCGAGCCGCC GCGCGGCGGG TCTTCCTGGT CTCTCGCTGG CGTGGGGCCT GTGGGAACAG 40560 

GCCACCGGCA TGACCGCGCA CCTCGGCGGC ACCGACCAGG CCCGGATGAG CCGGGGCGGG 40620 

G-rcCGGCCGA TCACGGCCGA GGAAGGCATG GCCCTGTTCG ACACGGCACT GGGTGCGCAG 40680 

CCCGCGCTGC TCGTGCCGGT CAAGCTCGAC CTGCGGGAGG TGCGGGCCGG CGGGGCCGTG 40740 

CCGCACCTCC TCCGCGGGCT GGTCCGGGCC GGGCGGCGGC AGGCCCAAGC CGCGTCCACA 40800 

GTGGACAACC AGCTGCTCGG CCGGCTGGCC GGGCTGGGCG CGCCCGAGCA GGAGGCGCTG 40860 

CTCGTCGACC TCGTCCGCGG CCAGGTCGCG GCGGTGCTCG GGCACGCCGG GCCGGACGCG 40920 

GTCCGCGCCG ACACGGCGTT CAAGGACGCC GGGTTCGACT CGCTCACCTC GGTCGACCTG 40980 

CGCAACCGGC TGCGGGAGAG CACCGGGCTG AAGCTGCCCG CCACGCTCGC CTTCGftCTAC 41040 

CCGACCCCGC TGGTCCTCGC CCGGCACCTG CGTGACGAGC TCGGGGCCQG CX3ACGACGCG 41100 

CTTTCGGTGG TGCACGCGCG GCTCGAAGAC GTCGAGGCGC TGCTCGGCGG GCTGCGCCTC 41160 

GACGAATCCA CGAAGACCGG TCTCACCCTC CGGCTGCAGG GCCTGGTCGC CCGGTGCAAC 41220 

GGCGTGAACG ACCAGACCGG CGGCGAAACG CTGGCGGACC QGCTCGAGGC CGCGTCCGCC 41280 

GACGAAGTCC TCGACTTCAT CGACGAGGAG CTGGGTCTCA CCTGACCCCG GTTCGAGACC 41340 
GACGTTCCAG CAACCCTTCT GAGGACCCGA GAATGGCCAC GGACGAGAAA CTCCTCAAAT 41400 
ACCTCAAGCG CGTCACGGCG GAGCTGCACA GCCTGCGCAA GCAGGGTGCC CGGCACGCCG 41460 
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ACGAGCCGCT CGCCGTCGTC GGGATGGCCT GCCGGTTCCC GGGTGGGGTG TCCTCGCCCG 41520 

AAGACCTGTC GCAGCTCGTC GCCGGCGGGG TCGACGCCCT TTCGGACTTC CCCGACGACC 41580 

GGGGCTGGGA GCTGGACGGC CTGTTCGACC CGGACCCCGA CCACCCCGGG ACGTCGTACA 41640 

CCAGCCAGGG CGGCTTCCTG CGTGGCGCCG GGCTGTTCGA CGCGGGCCTG TTCGGCATCT 41700 

CGCCGCGCGA GGCCCTCGTC ATGGACCCGC AGCAGCGGGT GCTGCTGGAG ACGTCGTGGG 41760 

AGGCCCTCGA AGACGCCGGG GTCGACCCGC TTTCGCTGAA GGGCAGCGAC GTCGGCGTGT 41820 

TCTCCGGCGT CTTCACCCAG GGCTJ^GGCG CCGGGGCGAT CACGCCGGJ^ CTCGA^ 41880 

TCGCGGGCAT CGGGGCGGCG TCGAGCGTGG CGTCGGGCCG GGTGTCCTAC GTCTTCGGGC 41940 

TCGAAGGACC GGCGGTCACC ATCGACACCG CGTGTTCGTC GTCGCTGGTG GCCATCCACC 42000 

TCGCCGCGCA GGCCCTGCGC GCGGGCGAGT GCTCGATGGC GCTCGCCGGC GGGGCGACGG 42060 

TGATGCCGAC GCCCGGCACC TTCGTCGCGT TCTCGCGGCA GCGGGTGCTG GCTGCCX3ACG 42120 

GCCGGTCCAA GGCCTTCTCC TCGACCGCGG ACGGCACCGG CTGGGCCGAG GGCGCCGGGG 42180 

TGCTCGTCCT CGAACGGCTT TCGGTCGCGC AGGAGCGCGG CCACCGGATT CTCGCCGTGC 42240 

TGCGCGGCAG CGCGGTCAAC CAGGATGGCG CCTCCAACGG CCTGACCGCG CCGAACGGGC 42300 

CTTCGCAGCA GCGGGTGATC CGCAAGGCGC TCGCGGGCGC CGGGCTGGTC GCGTCCGATG 42360 

TGGACGTCGT GGAGGCGCAC GGCACGGGCA CCGCGCTGGG CGACCCGATC GAAGCGCAGG 42420 

CGCTGCTGGC GACCTACGGC CAGGGCCGTG AGCGGCCGCT GTGGCTGGGG TCGGTCAAGT 42480 

CGAACTTCGG GCACACGCAG GCGGCCGCCG GGGTCGCGGG CCTGATCAAG ATGGTCCAGG 42540 
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CCCTGCGGCA CGGCGCCATG CCGCCGACCC TGCACGTGGC CGAGCCGACG CCGGAGGTCG 42600 

ACTCGTCGGC CGGTGCGGTG GAACTGCTGA CCGAGCCGCG CGAGTGGCCC GCCGGTGATC 42660 

GGCCGCGCCG GGCCGGGGTG TCCGCGTTCG GGATCAGCGG GACGAACGCC CACCTGATCC 42720 

TGGA.GGAGGC GCCCCCGGCC GACGCGGTCG CGGAAGAACC GGAGTTCAAG GGGCCGGTGC 42780 

CGCTGGTCGT CTCGGCGGGC AGCCCCACAT CTTTGGCGGC TCAGGCCGGC CGGCTCGCGG 42840 

AGGTCCTCGC GTCCGGTGGT GTGTCCCGGG CCCGGCTGGC GAGCGGGCTG CTGTCGGGCC 42900 

GGGCGCTGCT CGGTGACCGC GCGGTCGTGG TCGCGGGAAC GGACGAGGAC GCGGTGGCCG 42960 

GGTTGCGTGC GCTGGCCCGC GGGGACCGCG CGCCCGGCGT GCTGACCGGT TCGGCCAAGC 43020 

ACGGCAAGGT CGTCTACGTC TTCCCCGGCC AGGGTTCGCA GCGGCTCGGG ATGGGCCGCG 43080 

AGCTCTACGA CCGGTACCCG GTGTTCGCGA CGGCGTTCGA CGAGGCTTGC GAGCAGCTGG 43140 

ArGTCTGTCT GGCCGGCCGT GCCGGGCACC GCGTGCGGGA CGTCGTGCTC GGCGAAGTGC 43200 

CCGCCGAAAC CGGGCTGCTG AACCAGACGG TCTTCACCCA AGCCGGGCTG TTCGCGGTGG 43260 

AGAGCGCGCT GTTCCGGCTC GCCGAATCCT GGGGTGTCCG GCCGGACGTG GTGCTCGGCC 43320 

ACTCCATCGG GGAGATCACC GCCGCGTATG CCGCGGGCGT CTTCTCGCTG CCGGACGCCG 43380 

CCCGGATCGT CGCGGCGCGC" GGCCGGCTGA TGCAGGCGCT GGCGCCGGGC GGGGCGATGG 43440 

TCGCCGTCGC CGCCTCCGAA GCCGAGGTGG CCGAACTGCT CGGCGRCGGC GTGGAACTCG 43500 

CCGCCGTCAA CGGCCCTTCG GCGGTAGTCC TTTCCGGGGA CGCGGACGCG GTCGTCGCGG 43560 

CCGCCGCCCG CATGCGCGAG CGCGGGCACA AGACCAAGCA GCTCAAGGTT TCGCACGCGT 43620 

TCCACTCCGC GCGGATGGCG CCGATGCTGG CGGAGTTCGC CGCCGAGCTG GCCGGCGTGA 43680 
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CGTGGCGCGA GCCC3GAGATC CCGGTGGTCT CCAACGTGAC CGGCCGGTTC GCCGAGCCCG 43740 

GCGAACTGAC CGAGCCGGGC TACTGGGCCG AGCACGTGCG GCGGCCGGTG CGGTTCGCCG 43800 

AGGGCGTCGC GGCCGCGACG GAGTCCGGCG GCTCGCTGTT CGTGGAGCTC GGGCCGGGGG 43860 

CGGCGCTCAC CGCCCTCGTC GAGGAGACGG CCGAGGTCAC CTGCGTCGCG GCCCTGCGGG 43920 

ACGACCGCCC GGAGGTCACC GCGCTGATCA CCGCGGTCGC CGAGCTGTTC GTCCGCGGGG 43980 

TTGCGGTCGA TTGGCCGGCC CTGCTGCCGC CX3GTCACCGG GTTCGTCGAC CTGCCGAAGT 44040 

ACGCCTTCGA CCAGCAGCAC TATTGGCTGC AGCCCGCTGC GCAGGCCACG GftCGCGGCCT 44100 

CGCTCGGGCA GGTCGCGGCC GACCACCCGC TGCTGGGCGC GGTGGTCCGG CTGCCGCAGT 44160 

CGGACGGCCT GGTCTTCACC TCGCGGCTGT CATTGAAATC GCACCCGTGG CTGGCX:GACC 44220 

ACGTCATCGG CGGGGTGGTG CTCGTCGCGG GCACCGGGCT CGTCGAGCTG GCCGTCCGGG 44280 

CCGGGGACGA GGCCGGCTCC CCGGTCCTCG AAGAACTCGT CATCGAGGCT CCGCTGGTCG 44340 

TCCCCGACCA CGGCGGGGTC CGGATCO^ TCGTCGTGGG GGCACCGGGG GAGACCGGT'T 44400 

CGCGCGCGGT CGAGGTGTAC TCCCTGCGCG AGGACGCCGG TQCCGAAGTG TGGGCCCGGC 44460 

ACGCCACCGG GTTCCTGGCT GCGACGCCGT CGCAGCACAA GCCGTTCGAC TTCACCGCCT 44520 

GGCCGCCGCC CGGCGTCGAG CGCGTCGACG TCGAGGACTT CTACGACXMC CTCGTCGACC 44580 

GCGGGTACGC CTACGGGCCG TCGTTCCGGG GCCTGCGGGC GGTGTGGCGG CGCGGCGACG 44640 

AAGTGTTCGC CGAGGTCGCC CTGGCCGAGG ACGACCGCGC GGACGCGGCC CGGTTCGGCA 44700 

TCCACCCCGG CCTGCTGGAC GCCGCCCTGC ACGCGGGCAT GGCCGGTGCC ACCACCACGG 44760 
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AAGAGCCCGG CCGGCCGGTG CTGCCGTTCG CCTGGAACGG CCTGGTGCTG CACGCGGCCG 
GGGCGTCCGC GCTGCGGGTC CGGCTCGCCC CGAGCGGTCC GGACGCCCTG TCGGTCGAGG 
CCGCGGACGA GGCCGGCGGT CTCGTTGTGA CGGCGGACTC GCTGGTCTCC CGGCCGGTGT 
CGGCCGAACA GCTGGGCGCG GCGGCGAACC ACGACGCGTT GTTCCGCGTG GAGTGGACCG 
AGATTTCCTC GGCTGGAGAC GTTCCGGCGG ACCACGTCGA AGTGCTCGAA GCCGTCGGCG 
AGGATCCCCT GGAACTGACC GGCCGGGTCC TGGAGGCCGT GCAGACCTGG CTCGCCGACG 
CAGCCGACGA CGCTCGCCTG GTCGTGGTGA CCCGCGGCGC CGTCCACGAG GTGACTGACC 
CGGCCGGTGC CGCGGTGTGG GGCCTGATCC GGGCCGCGCA GGCGGAAAAC CCGGACCGGA 
TCGTGCTGCT GGACACCGAC GGTGAAGTGC CGCTAGGCCG GGTGCTGGCC ACCGGCGAGC 
CCCW-CAGC CGTCCGAGGC GCCACGCTGT TCGCCCCGCG GCTGGCCCGC GCCGAGGCCG 
CGGAGGCACC GGCAGTGACC GGCGGGACGG TCCTGATCTC GGGCGCCGGC TCGCTGGGCG 
CGCTCACCGC CCGGCACCTG GTCGCCCGGC ACGGAGTCCG GCGGCTGGTG CTCGTCAGCC 
GCCGTGGCCC CGACGCCGAC GGCATGGCCG AACTGACCGC TGAACTCATC GCTCAGGGCG 
CCGAGGTCGC CGTAGTCGCT TGCGACCTGG CCGACCGGGA CCAGGTCCGG GTACTGCTGG 
CCGAGCACCG CCCGAACGCC GTCGTGCACA CGGCCGGTGT TCTCGACGAC GGCGTCTTCG 
AGTCGCTCAC GCGGGAGCGG CTGGCCAAGG TCTTCGCGCC CAAAGTTACT GCTGCCAATC 
ACCTCGACGA GCTGACCCGC GAACTGGATC TTCGCGCGTT CGTCGTGTTC TCCTCCGCCT 
CCGGGGTCTT CGGCTCCGCC GGGCAGGGCA ACTACGCCGC TGCCAACGCC TACCTGGACG 
CCGTGGTCGC CAACCGCCGG GCCGCGGGCC TGCCCGGCAC ATCGCTGGCC TGGGGCCTGT 
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GGGAACAGAC CGACGGGATG ACCGCGCACC TCGGCGACGC CGACCAGGCG CGGGCGAGTC 45960 

GCGGCGGGGT CCTCGCCATC TCACCCGCCG AAGGCATGGA GCTCTTCGAC GCAGCGCCGG 46020 

ACGGGCTCGT CGTCCCGGTC AAGCTGGACC TGCGCAAGAC CCGCGCCGGC GGGACGGTGC 46080 

CGCACCTGCT GCGCGGCCTG GTCCGCCCGG GACGGCAGCA GGCCCGTCCG GCGTCCACTG 46140 

TGGACAACGG ACTGGCCGGG CGACTCGCCG GGCTCGCGCC GGCGGAGCAG GAGGCGCTGC 46200 

TGCTCGACGT CGTCCGCACG CAGGTCGCGC TGGTGCTCGG GCACGCCGGG CCGGAGGCCG 46260 

TCCGCGCGGA CACGGCGTTC AAGGACACCG GCTTCGACTC GCTGACGTCG GTGGAACTGC 46320 

f i I 

Si 

2°^. GCAACCGGCT GCGCGAGGCG AGCGGGCTGA AGCTGCCCGC GACGCTCGTC TTCGRCTACC 46380 

P CGACGCCGGT CGCGCTGGCC CGCTACCTGC GTGACGAACT CGGCGACACG GTGGCAACAA 46440 

hi' 

CTCCGGTGGC CACCGCGGCC GCAGCGGACG CCGGCGAGCC GATCGCCATC GTCGGCATGG 46500 



CGTGCCGGCT GCCGGGCGGG GTCACCGATC CCGAAGGCCT GTGGCGCCTG GTGCGCGACG 46560 

GCCTCGAAGG GCTGTCTCCC TTCCCCGAGG ACCGGGGCTG GGACCTGGAG AACCTGTTCG 46620 

ACGACGACCC CGACCGCTCC GGCACGACGT ACACCAGCCG GGGCGGGTTC CTCGACGGCG 46680 

CCGGCCTGTT CGACGCGGGC TTCTTCGGGA TTTCGCCGCG CGAGGCGCTG GCCATGGACC 46740 

CGCAGCAGCG GCTGCTGCTC GAGGCGGCCT GGGAAGCCCT CGAAGGCACC GGTGTCGACC 46800 

CGGGCTCGTT GAAGGGCGCC GACGTCGGGG TGTTCGCCGG GGTGTCCAAC OVGGGCTATG 46860 

Ga^VTGGGCGC GGATCCGGCC GAACTGGCGG GGTACGCGAG CACGGCGGGC GCTTCGAGCG 46920 

TCGTCTCGGG CCGAGTCTCG TACGTCTTCG GGTTCGAAGG ACCGGCGGTC ACGATCGACA 46980 
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CGGCTTGCTC GTCGTCGCTG GTGGCGATGC ACCTGGCCGG GCAGGCGCTG CGGCAGGGCG 47040 

AGTGCTCGAT GGCCCTGGCC GGTGGCGTCA CGGTGATGGG GACGCCCGGC ACGTTCGTGG 47100 

AGTTCGCGAA GCAGCGCGGC CTGGCCGGCG ACGGCCGGTG CAAGGCCTAC GCCGAAGGCG 47160 

CGGACGGCAC GGGCTGGGCC GAGGGCGTCG GGGTCGTCGT GCTGGAGCGG CTGTCGGTGG 47220 

CGCGCGAGCG CGGGCACCGG GTGCTGGCCG TGCTGCGCGG CAGCGCGGTC AACTCCGACG 47280 

GCGCGTCCAA.CGGCCTGACC GCCCCCAACG GGCCGTCGCA GCAACGGGTG ATCCGCCGGG 47340 

a 

C3 CCCTGGCCGG CGCCGGCCTC GAACCGTCCG ATGTGGACAT CGTGGAAGGG CACGGCACCG 47400 

$ 

01 . GGACGGCGCT GGGCGACCCG ATCGAGGCGC AGGCCCTGCT GGCCACCTAC GGCAAGGACC 47460 

ii ■ 

5^"" GCGACCCGGA GACGCCGTTG TGGCTGGGGT CGGTGAAGTC GAACTTCGGC CACACGCAGT 47520 

^' CCGCGGCCGG CGTCGCCGGG GTGATCAAGA TGGTGCAGGC GCTGCGCCAC GGCGTCATGC 47580 

P. 

H CGCCCACCCT GCACGTGGAC CGGCCCACCA GCCAGGTCGA CTGGTCCGCG GGGGCCGTCG 47640 

Pi 

AAGTGCTGAC CGAGGCACGG GAGTGGCCGC GGAACGGCCG TCCGCGCCGG GCCGGGGTGT 47700 

CCTCGTTCGG GATCAGCGGC ACGAACGCCC ACCTGATCAT CGAAGAAGCA CCGGCCGAGC 47760 

CACAGCTTGC CGGACCACCG CCGGACGGCG GTGTGGTGCC GCTGGTCGTC TCGGCTCGC^ 47820 

GCCCCGGTGC CCTGGCCGGT CAGGCGCGTC GGCTGGCCAC GTTCCTCGGC GACGGGCCCC 47880 

TTTCCGACGT CGCCGGTGCG CTGACGAGCC GCGCCCTGTT CGGCGAGCGC GCGGTCGTCG 47940 

TGGCGGATTC GGCCGAGGAA GCCCGCGCCG GTCTGGGCGC ACTGGCCCGC GGCGAAGACG 48000 

CGCCGGGCCT GGTCCGCGGC CGGGTGCCCG CGTCCGGCCT GCCGGGCAAG CTCGTGTGGG 48060 

TGTTCCCCGG GCAGGGGACG CAGTGGGTGG GCATGGGCCG CGAACTCCTC GAAGAGTCTC 48120 
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CGGTGTTCGC CGAGCGGATC GCCGAGTGTG CGGCCGCGCT GGAGCCGTGG ATCGGCTGGT 
CGCTGTTCGA CGTCCTCCGT GGCGACGGTG ACCTCGATCG GGTCGATGTG CTGCAGCCCG 
CGTGCTTTCC GGTGATGGTC GGCTTGGCCG CGGTGTGGTC CTCGGCCGGG GTGGTCCCCG 
ATGCGGTGCT CGGCCACTCC CAGGGTGAGA TCGCCGCGGC GTGCGTGTCG GGTGCGTTGT 
CGCTGGAGGA TGCGGCGAAG GTGGTTGCCC TGCGCAGCCA GGCCATCGCC GCGAAGCTCT 
CCGGCCGCGG CGGGATGGCT TCGGTCGCCT TGGGCGAAGC CGATGTGGTG TCGCGGCTGG 
CGGACGGGGT CGAGGTGGCT GCCGTCAACG GTCCGGCGTC CGTGGTGATC GCGGGGGATG 
CCCAGGCCCT CGACGAAACG CTGGAAGCGC TGTCCGGOXSC GGGAATCCGG GCTCGGCGGG 
TGGCGGTGGA CTACGCCTCG CACACCCGGC ACGTCGAAGA CATCGAAGAC ACCCTCGCCG 
AAGCGCTGGC CGGGATCGAC GCCCGGGCGC CGCTGGTGCC GTTCCTCTCC ACCCTCACCG 
GCGAGTGGAT CCGGGACGAG GGCGTCGTGG ACGGCGGCTA CTGGTACCGG AACCTGCGCG 
GCCGGGTGCG GTTCGGCCCG GCCGTCGAGG CGCTGCTGGC CCAGGGGCAC GGTGTGTTCG 
TCGAGCTCiyG CGCCCACCCG GTGCTGGTCC AGCCGATCAC CGJ^SCTCACC GACGAAACCG 
CCGCCGTCGT CACCGGTTCG CTGCGCCGGG ACGACGGTGG CCTGCGCCGG CTGCTGACCT 
CGATGGCCGA GCTCTTCGTC CGTGGGGTCG AAGTGGACTG GACGTCGCTG GTGCCGCCGG 
cccGGGCcGA cx:tcccgacg TACGCCTTCG ACCACGAGCA CTACTGGCTC CGCGCCGCGG 

acaccgcttc cgacgccgtc tcgctggggc tggccggggc ggaccacccg ctgctcggcg 
cggtcgtgca gcttccgcag tccgacggcc tggtcttcac ttcccggctc tccctgcgct 



-98- 

CGCACCCCTG GCTGGCCGAC CACGCGGTCC GGGACGTCGT GATCGTCCCC GGCACCGGGC 49260 

TCGTCGAGCT GGCCGTGCGG GCCGGTCACG AAGCCGGCTG CCCGGTGCTC GACGAGCTGG 49320 

TGATCGAGGC GCCGCTCGTG GTGCCCCGCC GCGGCGGGGT CCGCGTGCAG GTCGCCCTCG 49380 

GCQGCCCCGC CGACGACGGT TCGCGCACGG TGGACGTCTT CTCCCTGCGC GAAGACGCGG 49440 

ACAGCTCGCT CCGGCACGCC ACGGGCGTGC TGGTCCCGGA GAACCGGCCG CGGGGGACCG 49500 

CCGCGTTCGA CTTCGCCGCC TGGCCGCCAC CGGAGGCGAA GCCCGTGGAC CTCACCGGTG 49560 

CCTACGACGT GCTCGCGGAC GTCGGGTACG GCTACGGGCC CACGTTCCGG GCCGTGCGGG 49620 

CCGTGTGGCG GCGCGGCAGC GGGAACACCA CCGAGACCTT CGCCGAGATC GCCCTGCCCG 49680 

AAGACGCCCG CGCGGAAGCC GGCCGGTTCG GCATCCACCC CGCGCTGCTG GACGCGGCCC 49740 

TCCACTCGAC GATGGTCAGC GCCGCGGCGG ACACCGAGTC CTACGGCGAC GAAGTCCGGC 49800 

TGCCGTTCGC GTCGAACGGG CTGCGGCTGC ACGCGGCCGG CGCCTCGGTG CTGCGGGTGC 49860 

GCGTCGCCAA GCCCGAGCGG GACAGTCTCT CGCTGGAGGC CGTCGACGAG TCCGGCGGCC 49920 

TGGTCGTGAC GCTGGATTCC CTC-GTCGGGC GCCCGGTGTC GAACGACCAG CTGACGACGG 49980 

CGGCGGGGCC GGCGGGCGCC GGCTCGCTGT ACCGCGTGGA CTGGACGCCA TTGTCCTCAG 50040 

TGGACACTTC GGGACGGGTG CCGTCCTGGC TTCCGGTCGC CACCGCGGAA GAGGTGGCGA 50100 
CGCTGGCCGA CGACGTCCTC ACCGGCGCGA CCGAGGCGCC GGCGGTGGCC GTCATCGAGG 50160 
CCGTCGCCGA CGAGGGTTCC GTGCTGGCGC TO^-CCGTCCG GGTGCTGGAC GTGGTCCAGT 50220 
GCTGGCTCGC CGGCGGCGGG CTGGAGGGGA CGAAGCTCGC GATCGTQftCC CGCGGCGCGG 50280 
TGCCCGCCGG CGACGGCGTG GTGCACGACC CGGCCGCGGC CGCGGTGTGG GGGCTGGTCC 50340 
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GGGCCGCGCA GGCGGAGAAC CCGGACCGGA TCGTCCTCCT CGACGTCGAG CCGGAAGCCG 50400 

ACGTACCGCC GCTGCTGGGT aXTGGTCCTCG CCGACGGCGA GCCGCAGGTC GCGGTGCGCG 50460 

GAACCACGCT GTCCATCCCC CGCCTCGCCC GCGCCGCCCG GCCCGACCCG GCCGCCGGGT 50520 

TCAAGJ^CCG GGGACCGGTG CIGGTCACCG GCGGGACCGG GTCGCTCGGC GGCCTGGTCG 50580 

CCCGGCACCT GGTCGAGCGG CACGGCGTCC GGCAGCTGGT GCTGGCGAGT CGCCGGGGCC 50640 

TGGACGCCGA AGGCGCGAAG GACCTCGTCA CCGACCTCAC CGCACTGGGG GCCGACGTCG 50700 

C3 CGGTCGCCGC TTCCGACGTC GCCGACCGGG ACCAGGTGGC GGCCCTGCTG ACCGAGCACC 50760 

nj 

m GGCCGTCCGC CGTCGTGCAC ACGGCCGGCG TCCCGGACGC CGGGGTGATC GGGACGGTGA 50820 



ri. 



m 



S w 

£3 



CCCCGGACCG GCTGGCCGAG GTGTTCGCGC CCAAGGTCAC CGCGGCCCGG CACCTCGftCG 50880 

AGCTGACCCG CGACCTCGAC CTCGACAGTT TCGTCGTCTA CTCCTCGGTT TCCGCGGTGT 50940 

\ TCATGGGCGC CGGCAGCGGC AGCTACGCCG CGGCGAACGC GTACCTGGAC GGGCTGATGG 51000 

CCCACCGGCG CGCGGCCGGC CTGCCGGGCC AGTCGCTGGC GTGGGGGCTG TGGGACCAGA 51060 

CCACCGGCGG CATGGCGGCC GGGACCGACG AGGCCGGCCG GGCCCGGATG ACCCX3GCGCG 51120 

GCGGCCTCGT CGCGATGAAA CCCGCCGCCG GACTGGACCT CTTCGACGCT GCCATCGGGT 51180 

CCGGCGAGCC GCTGCTGGTG CCCGCCCAGC TCGACCTGCG GGGCCTGCGC GCCGAAGCGG 51240 

CGGGCGGCAC CGAAGTGCCG CACCTGCTGC GCGGCCTGGT CCGCGCCGGA CGCCAGCAGG 51300 

CCCGTGCGGC GTCCACTGTC GAGGAa^ACT GGGCCGGCCG GCTGGCCGGG CTCGAGCCGG 51360 

CCGAGCGGGG CCAGGTCCTC CTGGAACTGG TGCGCGCCCA GGTGGCAGGG GTCCTGGGCT 51420 
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ACCGCGCCGC CCACCAGGTC GACCCGGACC AGGGCCTGTT CGAGATCGGG TTCGACTCGC 51480 

TCACCGCGAT CGAACTCCGC AACCGGCTGC GCGCCAGGAC CGAACGGAAG ATCTCGCCXG 51540 

GTCTCGTCTT CGACCATCCC ACGCCGGCCC TGCTCGCCGC GCACTTGAAC GAGCTGCTCC 51600 

GAAAGAAGGT GTCAACGTCT TCGACGTGGA GftCCTACCTC CAGCGGATCG GCTGCGGCGG 51660 

GSVAACCGGC GTGGACCTCG AAACGCTGGC GAAGCTGCAG AAGAGCCACC TGATGGCGAT 51720 

CCCGTACAGC AGCCTCGCCT ACGAACTCCG GGACGCGGTG AACGTCGTCG ACCTCGACGA 51780 

GGACGACGTC TrCGTCACCA GCATCGCCGA AGGGCAGGGC GGCGCCTGCT ACCACCTGAA 51840 

CCGGCTCTTC CACCGGCTCC TCACCGAACT CGGCTACGAC GTCACGCCGC TGGCCGGCAG 51900 

OiCCGCCGAA GGCCGGGAGA CCTTCGGCAC CGACGTCGAG CACATGTTCA ACCTGGTCAC 51960 

CCTGGACGGC GCCGACTGGC TCGTGGACGT CGGCTACCCC GGCCCCACCT ACGTCGAGCC 52020 

ACTGGCGGTC TCGCCCGCGG TGCAGACCCA GTACGGGAGC CAGTTCCGGT TGGTGGAACA 52080 

GGAAACCGGT TATGCGCTGC AACGCCGGGG TGCGGTCACC CGCTGGAGCG TCGTCTACAC 52140 
GTTCACGACG CAACCGCGTC AGTGGAGTGA CTGGAAGGAA CTGGAGGACA ACTTCCGGGC 52200 
CCTCGTGGGG GACACCACCC GCACCGACAC GCAGGAAACC CTGTGCGGCC GCGCGTTCGC 52260 
GAACGGCCAG GTCTTCCTCC GGCAGCGCCXS CTACCTGACG GTCGAGAACG GCCGCGAGCA 52320 
GGTGCGCACG ATCACCG2«:G ACGACGAGTT CCGGGCGCTG GTGTCCCGCG TGCTGTCCGG 52380 
CGACCACGGC TGAACTGGCG AAAGGCACGA CGATGACGGA AAAAGCGGGC CTGCTGGCGA 52440 
AGTTCGCCGG CCOXirPGCAAA ACCGCCTACG AGCACCACTA CATCCCGTAC CTGCACTTCT 52500 
OCTACGGCGG CGAGTACCTC CACCACGGCA GCGAGCCGGT GTCCCGGATC GCGGACCTGC 52560 



- 101 - 



CGTACGTGAC CGTCCCGGAG CCGCGGAAa\ AGGCGCCGTG AGGACGACGA TCCCGGTCCG 52620 

CCTGGCGGAA CGGTCCTACG ACGTGCTCGT CGGCCCCGGG GTGCGGGCGG CGCTGCCCGA 52680 

GGTCGTCCGG CGGCTCGGCG CGAGACGGGC CGTGGTCGTG TCGGCCCGGC CGGCGGACTG 52740 

GGTGCCCGGC ACCGGCGTCG AGACCCTGCT GCTCCAGGCG CGCGRCGGCG AGCCGACCAA 52800 

GCGGCTGTCC ACAGTGGAGG AACTGTGCGG TGAGTTCGCG CGGTTCGGGC TCACCCGGTC 52860 

l=i CGACGTCGTG GTCTCCTGCG GCGGCGGCAC GACCACGGAC GTCGTCGGGC TCGCGGCCGC 52920 

ji GCTGTACCAC CGGGGGGTCG CCGTGGTCCA CCTGCCCACG TCCCTGCTCG CCCAGGTCGA 52980 

ij 

CGCCAGCGTC GGCGGGAAGA CCGCGGTGAA CCTGCCGGCG GGCAAGAACC TCGTCGGGGC 53040 

m 

l^, GTACTGGCAG CCCAGCGCGG TGCTGTGCGA CACGGACTAC CTGACGACGC TGCCGCGGCG 53100 

s . ■ 

S3 GGAGGTGCTG AACGGCCTCG GCGAGATCGC CCGCTGCCAC TTCATCGGCG CGCCGGACCT 53160 

rJ GCGGGGGCGC TCGCGCCCGG AGCAGATCGC CGCCAGCGTC ACCCTCAAGG CGGGCATCGT 53220 

CGCGCAGGAC GAGCGGGACA CCGGCCCGCG GCACCTGCTC AACTACGGCC ACACGCTGGG 53280 

GCACGCGCTG GAGATCGCGA CCGGCTTCGC CCTGCGCCAC GGCGAGGCGG TGGCGATCGG 53340 

CACGGTCTTC GCGGGCCGGC TGGCCGGCGC GCTCGGCCGC CTCGACCAGT CCGGTGTGGA 53400 

CGAACACCTC GCCGTCGTCC GCCACTACGG CCTGCCCGCC GCGCTGCCCG CGGACGTCGA 53460 

CCCGGCGGTG CTCGTCCGGC AGATGTACCG GGACAAGAAG GCGATCACCG GGCTCGCCTT 53520 

CGTCCTGGCC GGGCCGCGGG GCGCGGAGCT GGTGAGCGAC GTGCCGGCGC CGGTCGTCAC 53580 

CGACGTCCTG GACCGGATGC CCCGCGACAG CCTGGAAAAC CTGGTGGGGA CGACGGAAGC 53640 
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GGCGGCGCCG TCAAC3CGGCA GCCGGACTTC GCGGCCCACG GCCGGGCGGT CGACCGGGTG 53700 
GTCGCCGGCC GGCTGAGCGC GGCGCTGGCC CGGCCGGCCG CGCAGCAGCC GGGCTGGCCG 53760 
GACGCCGAGC GGGCGGCCGA GGTGAATTC 53789 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 4572 aitiino acids 

C3. (B) TYPE: amino acid 

2*; (C) STRANDEDNESS : single 

i*J 

01 (D) TOPOLOGY: linear 

y (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Phe Tyr Thr Ser Gly Thr Thr Gly Arg Pro Lys Gly Val Val Ser 
15 10 15 

Thr Gin Arq Asn Cys Leu Trp Ser Val Ala Ser Cys Tyr Val Pro Phe 
20 25 - 30 



Pro Gly Leu Ser Asp Gin Asp Arg Val Leu Trp Pro Leu Pro Leu Phe 
35 40 45 

His Ser Leu Ser His He Ala Cys Val Leu Ser Ala Thr Val Val Gly 
50 55 60 

Ala Ser Val Arg He Ala Asp Gly Ser Ser Ala Asp Asp Val Met Arg 
65 70 75 80 
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Leu He Glu Ala Glu Ser Ser Thr Phe Leu Ala Gly Val Pro Thr Thr 
85 90 95 

Tyr His His Leu Val Arg Ala Ala Arg Gin Arg Gly Phe Ser Ala Pro 
100 105 110 

Ser Leu Arg He Gly Leu Ala Gly Gly Ala Val Leu Gly Ala Gly Leu 
115 120 125 

Arg Ser Glu Phe Glu Glu Thr Phe Gly Val Pro Leu He Asp Ala Tyr 
130 135 140 

Gly Ser Thr Glu Thr Cys Gly Ala He Thr Met Asn Pro Pro Asp Gly 
145 150 155 160 

Ala Arg Val Glu Gly Ser Cys Gly Leu Ala Val Pro Gly Val Asp Val 
165 170 175 

Arg Val Val Asp Pro Asp Thr Gly Leu Asp Val Pro Ala Gly Glu Glu 
180 185 190 

Gly Glu Val Trp Val Ser Gly Pro Asn Val Met Leu Gly Tyr His Asn 
195 200 205 

Ser Pro Glu Ala Thr Ala Ala Ala Met Arg Asp Gly Trp Phe Arg Thr 
210 215 220 

Gly Asp Leu Ala Arg Arg Asp Asp Ala Gly Tyr Phe Thr He Cys Gly 
225 230 235 240 

Arg He Lys Glu Leu He He Arg Gly Gly Ala Asn He His Pro Gly 
245 250 255 

Glu Val Glu Ala Val Leu Arg Thr Val Asp Gly Val Ala Asp Ala Ala 
260 265 270 
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Val Gly Gly Val Pro His Asp Thr Leu Gly Glu Val Pro Val Ala Tyr 
275 280 285 

Val lie Pro Gly Pro Thr Gly Phe Asp Pro Ala Ala Leu He Glu Lys 
290 295 300 

Cys Arg Glu Gin Leu Ser Ala Tyr Lys Val Pro Asp Arg He Leu Glu 
305 310 315 320 

Val Ala His He Pro Arg Thr Ala Ser Gly Lys He Arg Arg Gly Leu 
325 330 335 

Leu Thr Asp Glu Pro Ala Gin Leu Arg Tyr Ala Ala Thr Glu His Glu 
340 345 350 

Glu Gin Ser Arg His Ala Asp Glu Ser Val Ala Ala Ala Leu Arg Ala 
355 360 365 

Arg Leu Ser Gly Leu Asp Glu Arg Ala Gin Cys Glu Leu Leu Glu Asp 
370 375 380 

Leu Val Arg Thr Gin Ala Ala Asp Val Leu Gly Gin Pro Val Pro Asp 
385 390 595 400 

Gly Arg Ala Phe Arg Asp Leu Gly Phe Thr Ser Leu Ala He Val Glu 
405 410 415 

Leu Arg Asn Arg Leu Thr Glu His Thr Gly Leu Trp Leu Pro Ala Ser 
420 425 430 

Ala Val Phe Asp His Pro Thr Pro Ala Ala Leu Ala Ala Arg Val Arg 
435 440 445 



Ala Glu Leu Leu Gly He Thr Gin Ala Val Ala Glu Pro Val Val Ala 
450 455 460 



Ala Asp Pro Gly Glu Pro He Ala He Val Gly Met Ala Cys Arg Leu 
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465 



470 



475 



480 



Pro Gly Gly Val Ala Ser Pro Glu Asp Leu Trp Arg Leu Val Ala Glu 
485 490 495 



Arg Val Asp Ala Val Ser Glu Phe Pro Gly Asp Arg Gly Trp Asp Leu 
500 505 510 

Asp Ser Leu He Asp Pro Asp Arg Glu Arg Ala Gly Thr Ser Tyr Val 
515 520 525 

Gly Gin Gly Gly Phe Leu His Asp Ala Gly Glu Phe Asp Ala Gly Phe 
530 535 540 

Phe Gly He Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg 
545 550 555 560 

Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Asn Ala Gly Val Asp 
565 570 575 



Pro He Ala Leu Lys Gly Thr Asp Thr Gly Val Phe Ser Gly Leu Met 
580 585 590 

Gly Gin Gly Tyr Gly Ser Gly Ala Val Ala Pro Glu Leu Glu Gly Phe 
5S5 600 605 



Val Thr Thr Gly Val Ala Ser Ser Val Ala Ser Gly Arg Val Ser Tyr 
610 615 620 

Val Leu Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser 
625 630 635 640 

Ser Ser Leu Val Ala Met His Leu Ala Ala Gin Ala Leu Arg Gin Gly 
645 650 655 



Glu Cys Ser Met Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro 
660 665 670 
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Gly Ser Phe Val Glu Phe Ser Arg Gin Arg Ala Leu Ala Pro Asp Gly 
675 680 685 

Arg Cys Lys Ala Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Ser Glu 
690 695 700 

Gly Val Gly Val Val Val Leu Glu Arg Leu Ser Val Ala Axg Glu Arg 
705 710 715 720 

Gly His Arg He Leu Ala Val Leu Arg Gly Ser Ala Val Asn Gin A^p 
725 730 735 

Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Leu Ser Gin Gin Arg 
740 745 750 

Val He kxg Arg Ala Leu Ala Ala Ala Gly Leu Ala Pro Ser Asp Val 
755 760 765 

Asp Val Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro He 
770 775 780 

Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin Glu Arg Lys Gin Pro 
785 790 795 800 

Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala 
805 810 815 

Ala Gly Val Ala Gly Val He Lys Met Val Gin Ala Leu Arg His Glu 
820 825 830 



Thr Leu Pro Pro Thr Leu His Val Asp Lys Pro Thr Leu Glu Val Asp 
835 840 845 



Trp Ser Ala Gly Ala He Glu Leu Leu Thr Glu Ala Arg Ala Trp Pro 
850 855 860 
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Arg Asn Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Val Ser 
865 870 875 880 

Gly Thr Asn Ala His Leu He Leu Glu Glu Ala Pro Ala Glu Glu Pro 
885 890 895 

Val Ala Ala Pro Glu Leu Pro Val Val Pro Leu Val Val Ser Ala Arg 
900 905 910 

Ser Thr Glu Ser Leu Ser Gly Gin Ala Glu Arg Leu Ala Ser Leu Leu 
915 920 925 

Glu Gly Asp Val Ser Leu Thr Glu Val Ala Gly Ala Leu Val Ser Arg 
930 935 940 

Arg Ala Val Leu Asp Glu Arg Ala Val Val Val Ala Gly Ser Arg Glu 
945 950 955 960 

Glu Ala Val Thr Gly Leu Arg Ala Leu Asn Thr Ala Gly Ser Gly Thr 
965 970 975 

Pro Gly Lys Val Val Trp Val Phe Pro Gly Gin Gly Thr Gin Trp Ala. 
980 985 990 

Gly Met Gly Arg Glu Leu Leu Ala Glu Ser Pro Val Phe Ala Glu Arg 
995 1000 1005 

He Ala Glu Cys Ala Ala Ala Leu Ala Pro Trp He Asp Trp Ser Leu 
1010 1015 1020 

Val Asp Val Leu Arg Gly Glu Gly Asp Leu Gly Arg Val Asp Val Leu 
1025 1030 1035 1040 

Gin Pro Ala Cys Phe Ala Val Met Val Gly Leu Ala Ala Val Trp Glu 
1045 1050 1055 



Ser Val Gly Val Arg Pro Asp Ala Val Val Gly His Ser Gin Gly Glu 
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1060 1065 1070 

lie Ala Ala Ala Cys Val Ser Gly Ala Leu Ser Leu Glu Asp Ala Ala 
1075 1080 1085 

Lys Val Val Ala Leu Arg Ser Gin Ala He Ala Ala Glu Leu Ser Gly 
1090 1095 1100 

Arg Gly Gly Met Ala Ser Val Ala Leu Gly Glu Asp Asp Val Val Ser 
1105 1110 1115 1120 

Arg Leu Val Asp Gly Val Glu Val Ala Ala Val Asn Gly Pro Ser Ser 
1125 1130 1135 

Val Val He Ala Gly. Asp Ala His Ala Leu Asp Ala Thr Leu Glu He 
1140 1145 1150 

Leu Ser Gly Glu Gly He Arg Val Arg Arg Val Ala Val Asp Tyr Ala 
1155 1160 1165 

Ser His Thr Arg His Val Glu Asp He Arg Asp Thr Leu Ala Glu Thr 
1170 1175 1180 

Leu Ala Gly He Ser Ala Gin Ala Pro Ala Val Pro Phe Tyr Ser Thr 
1185 1190 1195 1200 

Val Thr Ser Glu Trp Val Arg Asp Ala Gly Val Leu Asp Gly Gly Tyr 
1205 1210 1215 

Trp Tyr Arg Asn Leu Ajcg Asn Gin Val Arg Phe Gly Ala Ala Ala Thr 
1220 1225 1230 

Ala Leu Leu Glu Gin Gly His Thr Val Phe Val Glu Val Ser Ala His 
1235 1240 1245 



Pro Val Thr Val Gin Pro Leu Ser Glu Leu Thr Gly Asp Ala He Gly 
1250 1255 1260 
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Thr Leu Arg Arg Glu Asp Gly Gly Leu Arg Arg Leu Leu Ala Ser Met 
1265 1270 1275 1280 

Gly Glu Leu Phe Val Arg Gly He Asp Val Asp Trp Thr Ala Met Val 
1285 1290 1295 

Pro Ala Ala Gly Trp Val Asp Leu Pro Thr Tyr Ala Phe Glu His Arg 
1300 1305 1310 

His Tyr Trp Leu Glu Pro Ala Glu Pro Ala Ser Ala Gly T^p Pro Leu 
1315 1320 1325 

Leu Gly Thr Val Val Ser Thr Pro Gly Ser Asp Arg Leu Thr Ala Val 
1330 1335 1340 

Ala Gin Trp Ser Arg Arg Ala Gin Pro Trp Ala Val Asp Gly Leu Val 
1345 1350 1355 1360 

Pro Asn Ala Ala Leu Val Glu Ala Ala He Arg Leu Gly Asp Leu Ala 
1365 1370 1375 

Gly Thr Pro Val Val Gly Glu Leu Val Val Asp Ala Pro Val Val Leu 
1330 1385 1390 

Pro Arg Arg Gly Ser Arg Glu Val Gin Leu He Val Gly Glu Pro Gly 
1395 1400 1405 

Glu Gin Arg Aj:g Arg Pro lie Glu Val Phe Ser Arg Glu Ala Asp Glu 
1410 1415 1420 

Pro Trp Thr Arg His Ala His Gly Thr Leu Ala Pro Ala Ala Ala Ala 
1425 1430 1435 1440 



Val Pro Glu Pro Ala Ala Ala Gly Asp Ala Thr Asp Val Thr Val Ala 
1445 1450 1455 
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Gly Leu Arg Asp Ala Asp Arg Tyr Gly lie His Pro Ala Leu Leu Asp 
1460 1465 1470 

Ala Ala Val Arg Thr Val Val Gly Asp Asp Leu Leu Pro Ser Val Trp 
1475 1480 1485 

Thr Gly Val Ser Leu Leu Ala Ser Gly Ala Thr Ala Val Thr Val Thr 
1490 1495 1500 

Pro Thr Ala Thr Gly Leu Arg Leu Thr Asp Pro Ala Gly Gin Pro Val 
1505 1510 1515 1520 

Leu Thr Val Glu Ser Val Arg Gly Thr Pro Phe Val Ala Glu Gin Gly 
1525 1530 1535 

Thr Thr Ajsp Ala Leu Phe Arg Val Asp Trp Pro Glu lie Pro Leu Pro 
1540 1545 1550 

Thr Ala Glu Thr Ala Asp Phe Leu Pro Tyr Glu Ala Thr Ser Ala Glu 
1555 1560 1565 

Ala Thr Leu Ser Ala Leu Gin Ala Trp Leu Ala Asp Pro Ala Glu Thr 
1570 1575 1580 

Arg Leu Ala Val Val Thr Gly Asp Cys Thr Glu Pro Gly Ala Ala Ala 
1585 1590 1595 1600 

He Trp Gly Leu Val Arg Ser Ala Gin Ser Glu His Pro Gly Arg He 
1605 1610 1615 

Val Leu Ala Asp Leu Asp Asp Pro Ala Val Leu Pro Ala Val Val Ala 
1620 1625 1630 

Ser Gly Glu Pro Gin Val Arg Val Arg Asn Gly Val Ala Ser Val Pro 
1635 1640 1645 



Arg Leu Thr Arg Val Thr Pro Arg Gin Asp Ala Arg Pro Leu Asp Pro 
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1650 1655 1660 



Glu Gly Thr Val Leu lie Thr Gly Gly Thr Gly Thr Leu Gly Ala Leu 
1665 1670 1675 1680 

Thr Ala Arg His Leu Val Thr Ala His Gly Val Arg His Leu Val Leu 
1685 1690 1695 

Val Ser Arg Arg Gly Glu Ala Pro Glu Leu Gin Glu Glu Leu Thr Ala 
1700 1705 1710 

Leu Gly Ala Ser Val Ala lie Ala Ala Cys Asp Val Ala Asp Arg Ala 
1715 1720 1725 

Gin Leu Glu Ala Val Leu Arg Ala lie Pro Ala Glu His Pro Leu Thr 
1730 1735 1740 

Ala Val He His Thr Ala Gly Val Leu Asp Asp Gly Val Val Thr Glu 
1745 1750 1755 1760 

Leu Thr Pro Asp Arg Leu Ala Thr Val Arg Arg Pro Lys Val Asp Ala 
1765 1770 1775 

Ala Arg Leu Leu Asp Glu Leu Thr Arg Glu Ala Asp Leu Ala Ala Phe 
1780 1785 1790 

Val Leu Phe Ser Ser Ala Ala Gly Val Leu Gly Asn Pro Gly Gin Ala 
1795 1800 1305 

Gly Tyr Ala Ala Ala Asn Ala Glu Leu Asp Ala Leu Ala Arg Gin Arg 
1810 1815 1820 

Asn Ser Leu Asp Leu Pro Ala Val Ser He Ala Trp Gly Tyr Trp Ala 
1825 1830 1835 1840 



Thr Val Ser Gly Met Thr Glu His Leu Gly Asp Ala Asp Leu Arg Arg 
1845 1850 1855 
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Asn Gin Arg He Gly Met Ser Gly Leu Pro Ala Asp Glu Gly Met Ala 
1860 1865 1870 

Leu Leu Asp Ala Ala He Ala Thr Gly Gly Thr Leu Val Ala Ala Lys 
1875 1880 1885 

Phe Asp Val Ala Ala Leu Arg Ala Thr Ala Lys Ala Gly Gly Pro Val 
1890 1895 1900 

Pro Pro Leu Leu Arg Gly Leu Ala Pro Leu Pro Arg Arg Ala Ala Ala 
1905 1910 1915 1920 

Lys Thr Ala Ser I^eu Thr Glu Arg Leu Ala Gly Leu Ala Glu Thr Glu 
1925 1930 1925 

Gin Ala Ala Ala Leu Leu Asp Leu Val Arg Arg His Ala Ala Glu Val 
1940 1945 1950 

Leu Gly His Ser Gly Ala Glu Ser Val His Ser Gly Arg Thr Phe Lys 
1955 1960 1965 

Asp Ala Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu 
1970 1975 1980 

Ala Ala Ala Thr Gly Leu Thr Leu Ser Pro Ala Met He Phe Asp Tyr 
1985 1990 1995 2000 

Pro Lys Pro Pro Ala Leu Ala Asp His Leu Arg Ala Lys Leu Phe Gly 
2005 2010 2015 

Ser Ala Ala Asn Arg Pro Ala Glu He Gly Thr Ala Ala Ala Glu Glu 
2020 2025 2030 



Pro He Ala He Val Ala Met Ala Cys Arg Phe Pro Gly Gly Val His 
2035 2040 2045 
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Ser Pro Glu Asp Leu Trp Arg Leu Val Ala Asp Gly Ala Asp Ala Val 
2050 2055 2060 

Thr Glu Phe Pro Ala Asp Arg Gly Trp Asp Thr Asp Arg Leu Tyr His 
2065 2070 2075 2080 

Glu Asp Pro Asp His Glu Gly Thr Thr Tyr Val Arg His Gly Ala Phe 
2085 2090 2095 

Leu Asp Asp Ala Ala Gly Phe Asp Ala Ala Phe Phe Gly lie Ser Pro 
2100 2105 2110 

Asn Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr 
2115 2120 2125 

Ser Trp Glu Leu Phe Glu Arg Ala Ala He Asp Pro Thr Thr Leu Ala 
2130 2135 2140 

Gly Gin Asp He Gly Val Phe Ala Gly Val Asn Ser His Asp Tyr Ser 
2145 2150 2155 2160 

Met Arg Met His Arg Ala Ala Gly Val Glu Gly Phe Arg Leu Thr Gly 
2165 2170 2175 

Gly Ser Ala Ser Val Leu Ser Gly Arg Val Ala Tyr His Phe Gly Val 
2180 2185 2190 

Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val 
2195 2200 2205 

Ala Leu His Met Ala Val Gin Ala Leu Gin Arg Gly Glu Cys Ser Met 
2210 2215 2220 

Ala Leu Ala Gly Gly Val Met Val Met Gly Thr Val Glu Thr Phe Val 
2225 2230 2235 2240 

Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ala 
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2245 2250 2255 

Phe Ala Asp Gly Ala Asp Gly Thr Gly Trp Ser Glu Gly Val Gly Leu 
2260 2265 2270 

Leu Leu Val Glu Arg Leu Ser Glu Ala Gin Arg Arg Gly His Gin Val 
2275 2280 2285 

Leu Ala Val Val Arg Gly Ser Ala Val Asn Ser Asp Gly Ala Ser Asn 
2290 2295 2300 

Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg Val lie A-rg Lys 
2305 2310 2315 2320 

Ala Leu Ala Ala Ala Gly Leu Ser Thr Ser Asp Val Asp Ala Val Glu 
2325 2330 2335 

Ala His Gly Thr Gly Thr Thr Leu Gly Asp Pro lie Glu Ala Glu Ala 
2340 2345 2350 

Leu Leu Ala Thr Tyr Gly Gin Asn Arg Glu Thr Pro Leu Trp Leu Gly 
2355 2360 2365 

Ser Val Lys Ser Asn Leu Gly His Thr Glh Ala Ala Ala Gly Val Ala 
2370 2375 2380 

Gly Val He Lys Met Val Met Ala Met Arg His Gly Val Leu Pro Arg 
2385 2390 2395 2400 

Thr Leu His Val Asp Arg Pro Ser Ser Tyr Val Asp Trp Ser Ala Gly 
2405 2410 2415 

AJLa Val Glu Leu Leu Thr Glu Ala Arg Asp Trp Val Ser Asn Gly His 
2420 2425 2430 



Pro Arg Arg Ala Gly Val Ser Ser Phe Gly He Gly Gly Thr Asn Ala 
2435 2440 2445 
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His Val Val Leu Glu Glu Val Ala Ala Pro He Thr Thr Pro Gin Pro 
2450 2455 2460 

Glu Pro Ala Glu Phe Leu Val Pro Val Leu Val Ser Ala Arg Thr Ala 
2465 2470 2475 2480 

Ala Gly Leu Arg Gly Gin Ala Gly Arg Leu Ala Ala Phe Leu Gly Asp 
2485 2490 2495 

Arg Thr Asp Val Arg Val Pro Asp Ala Ala Tyr Ala Leu Ala Thr Thr 
2500 2505 2510 

Arg Ala Gin Leu Asp His Arg Ala Val Val Leu Ala Ser Asp Arg Ala 
2515 2520 2525 

Gin Leu Cys Ala Asp Leu Ala Ala Phe Gly Ser Gly Val Val Thr Gly 
2530 2535 2540 

Thr Pro Val Asp Gly Lys Leu Ala Val Leu Phe Thr Gly Gin Gly Ser 
2545 2550 2555 2560 

Gin Trp Ala Gly Met Gly Arg Glu Leu Ala Glu Thr Phe Pro Val Phe 
2565 2570 2575 

Arg Asp Ala Phe Glu Ala Ala Cys Glu Ala Val Asp Thr His Leu Arg 
2580 2585 2590 

Glu Arg Pro Leu Arg Glu Val Val Phe Asp Asp Ser Ala Leu Leu Asp- 
2595 2600 2605 

Gin Thr Met Tyr Thr Gin Gly Ala Leu Phe Ala Val Glu Thr Ala Leu 
2610 2615 2620 



Phe Arg Leu Phe Glu Ser Trp Gly Val Arg Pro Gly Leu Leu Ala Gly 
2625 2630 2635 2640 
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His Ser lie Gly Glu Leu Ala Ala Ala His Val Ser Gly Val Leu Asp 
2645 2650 2655 

Leu Ala Asp Ala Gly Glu Leu Val Ala Ala Arg Gly Arg Leu Met Gin 
2660 2665 2670 

Ala Leu Pro Ala Gly Gly Ala Met Val Ala Val Gin Ala Thr Glu Asp 
2675 2680 2685 

Glu Val Ala Pro Leu Leu Asp Gly Thr Val Cys Val Ala Ala Val Asn 
2690 2695 2700 

Glv Pro Asp Ser Val Val Leu Ser Gly Thr Glu Ala Ala Val Leu Ala 
2705 2710 2715 2720 

Val Ala Asp Glu Leu Ala Gly Arg Gly Arg Lys Thr Arg Arg Leu Ala 
2725 2730 2735 

Val Ser His Ala Phe His Ser Pro Leu Met Glu Pro Met Leu T^p Asp 
2740 2745 2750 

Phe Arg Ala Val Ala Glu Arg Leu Thr Tyr Arg Ala Gly Ser Leu Pro 
2755 2760 2765 

Val Val Ser Thr Leu Thr Gly Glu Leu Ala Ala Leu Asp Ser Pro Asp 
2770 2775 2780 

Tyr Trp Val Gly Gin Val Arg Asn Ala Val Arg Phe Ser Asp Ala Val 
2785 2790 2795 2800 

Thr Ala Leu Gly Ala Gin Gly Ala Ser Thr Phe Leu Glu Leu Gly Pro 
2605 2810 2815 

Gly Gly Ala Leu Ala Ala Met Ala Leu Gly Thr Leu Gly Gly Pro Glu 
2820 2825 2830 



Gin Ser Cys Val Ala Thr Leu Arg Lys Asn Gly Ala Glu Val Pro Asp 
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2835 2840 2845 

Val Leu Thr Ala Leu Ala Glu Leu His Val Arg Gly Val Gly Val Asp 
2850 2855 2860 

Trp Thr Thr Val Leu Asp Glu Pro Ala Thr Ala Val Gly Thr Val Leu 
2865 2870 2875 2880 

Pro Thr Tyr Ala Phe Gin His Gin Arg Phe Trp Val Asp Val Asp Glu 
2885 2890 2895 

Thr Ala Ala Val Ser Val Thr Pro Pro Pro Ala Glu Pro lie Val Asp 
2900 2905 2910 

Arg Pro Val Gin Asp Val Leu Glu Leu Val Arg Glu Ser Ala Ala Val 
2915 2920 2925 

Val Leu Gly His Arg Asp Ala Gly Ser Phe Asp Leu Asp Arg Ser Phe 
2930 2935 2940 

Lys Asp His Gly Phe Asp Ser Leu Ser Ala Val Lys Leu Arg Asn Arg 
2945 2950 2955 2960 

Leu Arg Asp Phe Thr Gly Val Glu Leu Pro Ser Thr Leu He Phe Asp 
2965 2970 2975 

Tyr Pro Asn Pro Ala Val Leu Ala Asp His Leu Arg Ala Glu Leu Leu 
2980 2985 2990 

Gly Glu Arg Pro Ala Ala Pro Ala Pro Val Thr Arg Asp Val Ser Asp 
2995 3000 3005 

Glu Pro He Ala He Val Gly Met Ser Thr Arg Leu Pro Gly Gly Ala 
3010 3015 3020 

Asp Ser Pro Glu Glu Leu Trp Lys L«u Val Ala Glu Gly Arg Asp Ala 
3025 3030 3035 3040 
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Val Ser Gly Phe Pro Val Asp Arg Gly Trp Asp Leu Asp Gly Leu Tyr 
3045 3050 3055 

His Pro Asp Pro Ala His Ala Gly Thr Ser Tyr Thr Arg Ser Gly Gly 
3060 3065 3070 

Phe Leu His Asp Ala Ala Gin Phe Asp Ala Gly Leu Phe Gly He Ser 
3075 3080 3085 

Pro Arg Glu Ala Leu Rla Met Asp Pro Gin Gin Arg Leu Leu Leu Glu 
30S0 3095 3100 

Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Val Asp Pro Leu Ser Ala 
3105 3110 3115 3120 

Arg Gly Ser Asp Val Gly Val Phe Thr Gly He Val His His Asp Tyr 
3125 3130 3135 

Val Thr Arg Leu Arg Glu Val Pro Glu Asp Val Gin Gly Tyr Thr Met 
3140 3145 3150 

Thr Gly Thr Ala Ser Ser Val Ala Ser Gly Arg Val Ala Tyr Val Phe 
3155 3160 3165 

Gly Phe Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser 
3170 3175 3180 

Leu Val Ala Met His Leu Ala Ala Gin Ala Leu Arg Gin Gly Glu Cys 
3185 3150 3195 3200 

Ser Met Ala Leu Ala Gly Gly Ala Thr Val Met Ala Ser Pro Asp Ala 
3205 3210 3215 

Phe Leu Glu Phe Ser Arg Gin Arg Gly Leu Ser Ala Asp Gly Arg Cys 
3220 3225 3230 
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Lys Ala Tyr Ala Glu Gly Ala Asp Gly Thr Gly Trp Ala Glu Gly Val 
3235 3240 3245 

Gly Val val Val Leu Glu Arg Leu Ser Val Ala Arg Glu Arg Gly His 
.3250 3255 3260 

Arg val Leu Ala Val Leu Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
3265 3270 3275 3280 

Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg Val He 
3285 3290 3295 

Arg Gly Ala Leu Ala Ser Ala Gly Leu Ala Pro Ser Asp Val Asp Val 
3300 3305 3310 

val Glu Gly His Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu Val 
3315 3320 3325 

Gin Ala Leu Leu Ala Thr Tyr Gly Gin Glu Arg Glu Gin Pro Leu Trp 
3330 3335 3340 

Leu Gly Ser Leu Lys Ser Asn Leu Gly His Thr Gin Ala Ala Ala Gly 
3345 3350 3355 3360 

Val Val. Gly Val He Lys Met He Met Ala Met Arg His Gly Val Met 
3365 3370 3375 

Pro Ala Thr Leu His Val Asp Glu Arg Thr Ser Gin Val Asp Trp Ser 
3380 3385 3390 

Ala Gly Ala He Glu Val Leu Thr Glu Ala Arg Glu Trp Pro Arg Thr 
3395 3400 3405 

Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ala Ser Gly Thr 
3410 3415 3420 

Asn Ala His Leu He He Glu Glu Gly Pro Ala Glu Glu Ala Val Asp 
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3425 . 3430 3435 3440 

Glu Glu Val Ala Ser Val Val Pro Leu Val Val Ser Ala Arg Ser Ala 
3445 3450 3455 

Gly Ser Leu Ala Gly Gin Ala Gly Arg Leu Ala Ala Val Leu Glu Asn 
3460 3465 3470 

Glu Ser Leu Ala Gly Val Ala Gly Ala Leu Val Ser Gly Arg Ala Thr 
3475 3480 3485 

Leu Asn Glu Arg Ala Val Val lie Ala Gly Ser Arg Asp Glu Ala Gin 
3490 3495 3500 

Asp Gly Leu Gin Ala Leu Ala Arg Gly Glu Asn Ala Pro Gly Val Val 
3505 3510 3515 3520 

Thr Gly Thr Ala Gly Lys Pro Gly Lys Val Val Trp Val Phe Pro Gly 
3525 3530 3535 

Gin Gly Ser Gin Trp Met Gly Met Gly Arg Asp Leu Leu Asp Ser Ser 
3540 3545 3550 

Pro Val Phe Ala Ala Axg lie Lys Glu Cys Ala Ala Ala Leu Glu Gin 
3555 3560 3565 

Trp Thr Asp Trp Ser Leu Leu Asp Val Leu Arg Gly Asp Ala Asp Leu 
3570 3575 3580 

Leu Asp Arg Val Asp Val Val Gin Pro Ala Ser Phe Ala Met Met Val 
35S5 3590 3595 3600 

Gly Leu Ala Ala Val Trp Thr Ser Leu Gly Val Thr Pro Asp Ala Val 
3605 3610 3615 

Leu Gly His Ser Gin Gly Glu He Ala Ala Ala Cys Val Ser Gly Ala 
3620 3625 3630 
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Leu Ser Leu Asp Asp Ala Ala Lys Val Val Ala Leu Arg Ser Gin Ala 
3635 3640 3645 

He Ala Gly Glu Leu Ala Gly Arg Gly Gly Met Ala Ser Val Ala Leu 
3650 3655 3660 

Ser Glu Glu Asp Ala Val Ala Arg Leu Thr Pro Trp Ala Asn Arg Val 
3665 3670 3675 3680 

Glu Val Ala Ala Val Asn Ser Pro Ser Ser Val Val He Ala Gly Asp 
3685 3690 3695 

Ala Gin Ala Leu Asp Glu Ala Leu Glu Ala Leu Ala Gly A-sp Gly Val 
3700 3705 3710 

Arg Val Arg Arg Val Ala Val Asp Tyr Ala Ser His Thr Arg His Val 
3715 3720 3725 

Glu Ala He Ala Glu Thr Leu Ala Lys Thr Leu Ala Gly He Asp Ala 
3730 3735 3740 

Arg Val Pro Ala He Pro Phe Tyr Ser Thr Val Leu Gly Thr Trp He 
3745 3750 3755 3760 

Glu Gin Ala Val Val Asp Ala Gly Tyr Trp Tyr Arg Asn Leu Arg Gin 
3765 3770 3775 

Gin Val Arg Phe Gly Pro Ser Val Ala Asp Leu Ala Gly Leu Gly His 
3780 3795 3790 

Thr Val Phe Val Glu He Ser Ala His Pro Val Leu Val Gin Pro Leu 
3795 3800 3805 

Ser Glu lie Ser Asp Asp Ala Val Val Thr Gly Ser Leu Arg Arg Asp 
3810 3815 3820 
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Asp Gly Gly Leu Arg Arg Leu Leu Ala Ser Ala Ala Glu Leu Tyr Val 
3825 3830 3835 3840 

Arg Gly Val Ala Val Asp Trp Thr Ala Ala Val Pro Ala Ala Gly Trp 
3845 3850 3855 

Val Asp Leu Pro Thr Tyr Ala Phe Asp Arg Arg His Phe Trp Leu His 
3860 3865 3870 

Glu Ala Glu Thr Ala Glu Ala Ala Glu Gly Met Asp Gly Glu Phe Trp 
3875 3880 3885 

Thr Ala lie Glu Gin Ser Asp Val Asp Ser Leu Ala Glu Leu Leu Glu 
3890 3895 3900 

Leu Val Pro Glu Gin Axg Gly Ala Leu Ser Thr Val Val Pro Val Leu 
3905 3910 3915 3920 

Ala Gin Trp Arg Asp Arg Arg Arg Glu Arg Ser Thr Ala Glu Lys Leu 
3925 3930 3935 

Arg Tyr Gin Val Thr Trp Gin Pro Leu Glu Arg Glu Ala Ala Gly Val 
3940 3945 3950 

Pro Gly Gly Arg Trp Leu Ala Val Val Pro Ala Gly Thr Thr Asp Ala 
3955 3960 3965 

Leu Leu Lys Glu Leu Thr Gly Gin Gly Leu Asp He Val Arg Leu Glu 
3970 3975 3980 

He Glu Glu Ala Ser Arg Ala Gin Leu Ala Glu Gin Leu Arg Asn Val 
3985 3990 3995 4000 

Leu Ala Glu His Asp Leu Thr Gly Val Leu Ser Leu Leu Ala Leu Asp 
4005 4010 4015 

Gly Gly Pro Ala Asp Ala Ala Glu He Thr Ala Ser Thr Leu Ala Leu 



4020 4025 4030 

Val Gin Ala Leu Gly Asp Thr Thr Thr Ser Ala Pro Leu Trp Cys Leu 
4035 4040 4045 



Thr Ser Gly Ala Val Asn He Gly He Gin Asp Ala Val Thr Ala Pro 
4050 4055 4060 

Ala Gin Ala Ala Val Trp Gly Leu Gly Arg Ala Val Ala Leu Glu Arg 
4065 4070 4075 4080 

Leu Asp Arg Trp Gly Gly Leu Val Asp Leu Pro Ala Ala He Asp Ala 
4085 4090 4095 

Arg Thr Ala Gin Ala Leu Leu Gly Val Leu Asn Gly Ala Ala Gly Glu 
4100 4105 4110 

Asp Gin Leu Ala Val Arg Arg Ser Gly Val Tyr Arg Arg Arg Leu Val 
4115 4120 4125 

Arg Lys Pro Val Pro Glu Ser Ala Thr Ser Arg Trp Glu Pro Arg Gly 
4130 4135 4140 

Thr Val Leu Val Thr Gly Gly Ala Glu Gly Leu Gly Arg His Ala Ser 
4145 4150 4155 4160 

Val Trp Leu Ala Gin Ser Gly Ala Glu Arg Leu He Val Thr Gly Thr 
4165 4170 4175 

Asp Gly Val Asp Glu Leu Thr Ala Glu Leu Ala Glu Phe Gly Thr Thr 
4180 4185 4190 

Val Glu Phe Cys Ala Asp Thr Asp Arg Asp Ala He Ala Gin Leu Val 
4195 4200 4205 



Ala Asp Ser Glu Val Thr Ala Val Val His Ala Ala Asp He Ala Gin 
4210 4215 4220 



-124- 



Thr Ser Ser Val Asp Asp Thr Gly .Val Ala Asp Leu Asp Glu Val Phe 
4225 4230 4235 4240 

Ala Ala Lys Val Thr Thr Ala Val Trp Leu Asp Gin Leu Phe Glu Asp 
4245 4250 4255 

Thr Pro Leu Asp Ala Phe Val Val Phe Ser Ser He Ala Gly He Trp 
4260 4265 4270 

Gly Gly Gly Gly Gin Gly Pro Ala Gly Ala Ala Asn Ala Val Leu Asp 
4275 4280 4285 

Ala Leu Val Glu Trp Arg Arg Ala. Arg Gly Leu Lys Ala Thr Ser He 
4290 4295 4300 

Ala Trp Gly Ala Leu Asp Gin He Gly He Gly Met Asp Glu Ala Ala 
4305 4310 4315 4320 

Leu Ala Gin Leu Arg Arg Arg Gly Val He Pro Met Ala Pro Pro Leu 
4325 4330 4335 

Ala Val Thr Ala Met Val Gin Ala Val Ala Gly Asn Glu Lys Ala Val 
4340 4345 4350 

Ala Val Ala Asp Met Asp Trp Ala Ala Phe He Pro Ala Phe Thr Ser 
4355 4360 4365 

Val Arg Pro Ser Pro Leu Phe Ala Asp Leu Pro Glu Ala Lys Ala He 
4370 4375 4380 

Leu Arg Ala Ala Gin Asp Asp Gly Glu Asp Gly Asp Thr Ala Ser Ser 
4385 4390 4395 4400 

Leu Ala Asp Ser Leu Arg Ala Val Pro Asp Ala Glu Gin Asn Arg He 
4405 4410 4415 
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Leu Leu Lys Leu Val Arg Gly His Ala Ser Thr Val Leu Gly His Ser 
4420 4425 4430 

Gly Ala Glu Gly He Gly Pro Arg Gin Ala Phe Gin Glu Val Gly Phe 
4435 4440 4445 

Asp Ser Leu Ala Ala Val Asn Leu Arg Asn Ser Leu His Ala Ala Thr 
4450 4455 4460 

Gly Leu Arg Leu Pro Ala Thr Leu He Phe Asp Tyr Pro Thr Pro Glu 
4465 4470 4475 4480 

p Ala Leu Val Gly Tyr Leu Arg Val Glu Leu Leu Arg Glu Ala Asp Asp 

□ 4485 4490 4495 

fij 

01 Gly Leu Asp Gly Arg Glu Asp Asp Leu Arg Arg Val Leu Ala Ala Val 

4500 . 4505 4510 

Pro Phe Ala Arg Phe Lys Glu Ala Gly Val Leu Asp Thr Leu Leu Gly 
4515 4520 4525 



ass; 

CJ. 



Leu Ala Asp Thr Gly Thr Glu Pro Gly Thr Asp Ala Glu Thr Thr Glu 
4530 4535 4540 

Ala Ala Pro Ala Ala Asp Asp Ala Glu Leu He Asp Ala Leu Asp He 
4545 4550 4555 4560 

Ser Gly Leu Val Gin Arg Ala Leu Gly Gin Thr Ser 
4555 4570 



(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5069 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Asn Gin Ser Trp Arg Lys Asn Met Ser Ala Pro Asn Glu Gin 
15 10 15 

lie Val Asp Ala Leu Arg Ala Ser Leu Lys Glu Asn Val Arg Leu Gin 
20 25 30 

Gin Glu Asn Ser Ala Leu Ala Ala Ala Ala Ala Glu Pro Val Ala He 
35 40 45 

Val Ser Met Ala Cys Arg Tyr Ala Gly Gly He Arg Gly Pro Glu Asp 
50 55 60 

Phe Trp Arg Val Vjal Ser Glu Gly Ala Asp Val Tyr Thr Gly Phe Pro 
65 70 75 80 

Glu Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr His Pro Asp Pro Asp 
85 90 95 

Asn Pro Gly Thr Thr Tyr Val Arg Glu Gly Ala Phe Leu Gin Asp Ala 
100 105 110 

Ala Gin Phe Asp Ala Gly Phe Phe Gly He Ser Pro Arg Glu Ala Leu 
115 120 125 

Ala Met Asp Pro Gin Gin Arg Gin Leu Leu Glu Val Ser Trp Glu Thr 
130 135 140 



Leu Glu Arg Ala Gly He Asp Pro His Ser Val Arg Gly Ser Asp He 
145 150 155 160 
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Gly Val Tyr Ala Gly Val Val His Gin Asp Tyr Ala Pro Asp Leu Ser 
165 170 175 

Gly Phe Glu Gly Phe Met Ser Leu Glu Arg Ala Leu Gly Thr Ala Gly 
180 185 190 

Gly Val Ala Ser Gly Arg Val AJLa Tyr Thr Leu Gly Leu Glu Gly Pro 
195 200 205 

Ala Val Thr Val Asp Thr Met Cys Ser Ser Ser Leu Val Ala lie His 
210 215 220 

Leu Ala Ala Gin Ala Leu Arg Arg Gly Glu Cys Ser Met Ala Leu Ala 
225 230 235 240 

Gly Gly Ser Thr Val Met Ala Thr Pro Gly Gly Phe Val Gly Phe Ala 
245 250 255 

Arg Gin Arg Ala Leu Ala Phe Asp Gly Arg Cys Lys Ser Tyr Ala Ala 
260 265 270 

Ala Ala Asp Gly Ser Gly Trp Ala Glu Gly Val Gly Val. Leu Leu Leu 
275 280 285 

Glu Arg Leu Ser Val Ala Arg Glu Arg Gly His Gin Val Leu Ala Val 
290 295 . 300 

He Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr 
305 310 315 320 

Ala Pro Asn Gly Pro Ala Gin Gin Arc Val He Arg Lys Ala Leu Ala 
325 330 335 



Ser Ala Gly Leu Thr Pro Ser Asp Val Asp Thr Val Glu Gly His Gly 
340 345 350 
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Thr Gly Thr Val Leu Gly Asp Pro lie Glu Val Gin Ala Leu Leu Ala 
355 360 . 365 

Thr Tyr Gly Gin Gly Arg Asp Pro Gin Gin Pro Leu Trp Leu Gly Ser 
370 375 380 

val Lys Ser Val Val Gly His Thr Gin Ala Ala Ser Gly Val Ala Gly 
385 390 395 400 

Val He Lys Met Val Gin Ser Leu Arg His Gly Gin Leu Pro Ala Thr 
405 410 415 

Gin His Val Asp Ala Pro Thr Pro Gin Val Asp Trp Ser Ala Gly Ala 
420 425 430 



He Glu Leu Leu Ala Glu Gly Arg Glu Trp Pro Arg Asn Gly His Pro 
435 440 445 

Arg Arg Gly Gly He Ser Ser Phe Gly Ala Ser Gly Thr Asn Ala His 



455 



',60 



Met He Leu Glu Glu Ala Pro Glu Asp Glu Pro Val Thr Glu Ala Pro 
465 470 475 480 

Ala Pro Tb^ Gly Val Val Pro Leu Val Val Ser Ala Ala Thr Ala Ala 
485 490 495 

Ser Leu Ala Ala Gin Ala Gly Arg Leu Ala Glu Val Gly Asp Val Ser 
500 505 510 

Leu Ala Asp Val Ala Gly Thr Leu Val Ser Gly Arg Ala Met Leu Ser 
515 520 525 



Glu Arg Ala Val Val Val Ala Gly Ser His Glu Glu Ala Val Thr Gly 
530 535 540 



Leu Arg Ala Leu Ala Arg Gly Glu Ser Ala Pro Gly Leu Leu Ser Gly 
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545 



550 



555 



560 



Arg Gly Ser Gly Val Pro Gly Lys Val Val Trp Val Phe Pro Gly Gin 
565 570 575 

Gly Thr Gin Trp Ala Gly Met Gly Arg Glu Leu Leu Asp Ser Ser Glu 
580 585 590 

Val Phe Ala Ala Arg He Ala Glu Cys Glu Thr Ala Leu Gly Axg Trp 
595 600 605 

Val Asp Trp Ser Leu Thr Asp Val Leu Arg Gly Glu Ala Asp Leu Leu 
610 615 620 

Asp Arg Val Asp Val Val Gin Pro Ala Ser Phe Ala Val Met Val Gly 
625 630 635 640 

Leu Ala Ala Val Trp Ala Ser Leu Gly Val Glu Pro Glu Ala Val Val 
645 650 655 

Gly His Ser Gin Gly Glu He Ala Ala Ala Cys Val Ser Gly Ala Leu 
660 665 670 

Ser Leu Glu Asp Ala AJLa Lys Val Val Ala Leu Arg Ser Gin Ala He 
675 680 685 

Ala Ala Ser Leu Ala Gly Arg Gly Gly Met Ala Ser Val Ala Leu Ser 
690 695 700 

Glu Glu Asp Ala Thr Ala Arg Leu Glu Pro Trp Ala Gly Arg Val Glu 
705 710 715 720 



Val Ala Ala Val Asn Gly Pro Thr Ser Val Val He Ala Gly Asp Ala 
725 730 735 



Glu Ala Leu Asp Glu Ala Leu Asp Ala Leu Asp Asp Gin Gly Val Arg 
740 745 750 
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Ile Arg Arg Val Ala Val Asp Tyr Ala Ser His Thr Arg His Val Glu 
755 760 765 

Ala Ala Arg Asp Ala Leu Ala Glu Met Leu Gly Gly He A-rg Ala Gin 
770 775 780 

Ala Pro Glu Val Pro Phe Tyr Ser Thr Val Thr Gly Gly Trp Val Glu 
785 790 795 800 

Asp AJLa Gly Val Leu Asp Gly Gly Tyr Trp Tyr Arg Asn Leu P^q Aj:g 
805 810 815 

Gin Val Arg Phe Gly Pro Ala Val Ala Glu Leu He Glu Gin Gly His 
820 825 830 

Arg Val Phe Val Glu Val Ser Ala His Pro Val Leu Val Gin Pro He 
835 840 845 

Asn Glu Leu Val Asp Asp Thr Glu Ala Val Val Thr Gly Thr Leu Arg 
850 855 860 

Arg Glu Asp Gly Gly Leu Arg Arg Leu Leu Ala Ser Ala Ala Glu Leu 
865 870 875 880 

Phe Val Arg Gly Val Thr Val Asp Trp Ser Gly Val Leu Pro Pro Ser 
885 890 895 



Arg Arg Val Glu Leu Pro Thr Tyr Ala Phe Asp His Gin His Tyr Trp 
900 905 910 

Leu Gin Met Gly Gly Ser Ala Thr Asp Ala Val Ser Leu Gly Leu Ala 
915 920 925 

Gly Ala Asp His Pro Leu Leu Gly Ala Val Val Pro Leu Pro Gin Ser 
930 935 940 
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Asp Gly Leu Val Phe Thr Ser Arg Leu Ser Leu Lys Ser His Pro Trp 
945 950 955 960 

Leu Ala Gly His Ala He Gly Gly Val Val Leu He Pro Gly Thr Val 
965 970 975 

Tyr Val Asp Leu Ala Leu Arg Ala Gly Asp Glu Leu Gly Phe Gly Val 
980 985 990 

Leu Glu Glu Leu Val He Glu Ala Pro Leu Val Leu Gly Glu Arg Gly 
995 1000 1005 

Gly Val Arg Val Gin Val Ala Val Ser Gly Pro Asn Glu Thr Gly Ser 
1010 1015 1020 

Arg Ala Val Asp Val Phe Ser Met Arg Glu Asp Gly Asp Glu Trp Thr 
1025 1030 1035 1040 

Arg His Ala Thr Gly Leu Leu Gly Ala Ser Thr Ser Arg Glu Pro Ser 
1045 1050 1055 

Ajrg Phe Asp Phe Ala Ala Trp Pro Pro Ala Gly Ala Glu Pro He Asp 
1C60 1065 1070 

Val Glu Asn Phe Tyr Thr Asp Leu Thr Glu Arg Gly Tyr Ala Tyr Ser 
1075 1080 1085 

Gly Ala Phe Gin Gly Met Arg Ala Val Trp Arc Arg Gly Asp Glu Val 
1090 1095 1100 

Phe Ala Glu Val Ala Leu Pro Asp Asp His Arg Glu Asp Ala Gly Lys 
1105 1110 1115 1120 

Phe Gly Leu His Pro Ala Leu Leu Asp Ala Ala Leu His Thr Asn Ala 
1125 1130 1135 

Phe Ala Asn Pro Asp Asp Asp Arg Ser Val Leu Pro Phe Ala Trp Asn 
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1140 1145 1150 

Gly Leu Val Leu His Ala Val Gly Ala Ser Ala Leu Arg Val Arg Val 
1155 1160 1165 

Ala Pro Gly Gly Pro Asp Ala Leu Thr Phe Gin Ala Ala Asp Glu Thr 
1170 1175 1180 

Gly Gly Leu Val Val Thr Met Asp Ser Leu Val Ser Arg Glu Val Ser 
1185 1190 1195 1200 

Ala Ala Gin Leu Glu Thr Ala Ala Gly Glu Glu Arg Asp Ser Leu Phe 
1205 1210 1215 

Gin Val Asp Trp He Glu Val Pro Ala Thr Glu Thr Ala Ala Thr Glu 
% 1220 1225 1230 

His Ala Glu Val Leu Glu Ala Phe Gly Glu Ala Ala Pro Leu Glu Leu 
. 1235 1240 1245 

Thr Ser Arg Val Leu Glu Ala Val Gin Ser Trp Leu Ala Asp Ala Ala 
1250 1255 1260 

Asp Glu Ala Arg Leu Val Val Val Thr Arg Gly Ala Val Arg Glu Val 
1265 1270 1275 1280 

Thr Asp Pro Ala Gly Ala Ala Val Trp Gly Leu Val Arg Ala Ala Gin 
1285 1290 1295 

Ala Glu Asn Pro Gly Arg He He Leu Val Asp Thr Asp Gly Asp Val 
1300 1305 1310 

Pro Leu Gly Ala Val Leu Ala Ser Gly Glu Pro Gin Leu Ala Val Arg 
1315 1320 1325 



Gly Asn Ala Phe Ser Val Pro Arg Leu Ala Arg Ala Thr Gly Glu Val 
1330 1335 1340 



- 133- 



Pro Glu Ala Pro Ala Val Phe Ser Pro Glu Gly Thr Val Leu Leu Thr 
1345 1350 1355 1360 

Gly Gly Thr Gly Ser Leu Gly Gly Leu Val Ala Lys His Leu Val Ala 
1365 1370 1375 

Arg His Gly Val Arg Arg Leu Val Leu Ala Ser Arg Arg Gly Val Ala 
1380 1385 1390 

Ala Glu Asp Leu Val Thr Glu Leu Thr Glu Gin Gly Ala Thr Val Ser 
1395 1400 1405 

Val Val Ala Cys Asp Val Ser Asp Arg Asp Gin Val Ala Ala Leu Leu 
1410 1415 1420 

Ala Glu His Arg Pro Thr Gly lie Val His Leu Ala Gly Leu Leu Asp 
1425 1430 1435 1440 

Asp Gly Val He Gly Ala Leu Asn Arg Glu Arg Leu Ala Gly Val Phe 
1445 1450 1455 

Ala Pro Lys Val Asp Ala Val Gin His Leu Asp Glu Leu Thr Arg Asp 
1460 1465 1470 

Leu Gly Leu Asp Ala Phe Val Val Phe Ser Ser Ala Ala Ala Leu Met 
1475 1480 1485 

Gly Ser Ala Gly Gin Gly Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp 
1490 1495 1500 

Gly Leu Met Ala Gly Arg Arg Ala Ala Gly Leu Pro Gly Val Ser Leu 
1505 1510 1515 1520 



Ala Trp Gly Leu Trp Glu Gin Ala Asp Gly Leu Thr Ala Asn Leu Ser 
1525 1530 1535 
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Ala Thr Asp Gin Ala Arg Met Ser Arg Gly Gly Val Leu Pro Met Thr 
1540 1545 1550 

Pro Ala Glu Ala Leu Asp He Phe Asp He Gly Leu Ala Ala Glu Gin 
1555 1560 1565 

Ala Leu Leu Val Pro He Lys Leu Asp Leu Arg Thr Leu Arg Gly Gin 
1570 1575 1580 

Ala Thr Ala Gly Gly Glu Val Pro His Leu Leu Arg Gly Leu Val Arg 
1585 1590 1595 1600 

Ala Ser Arg Arg Val Thr Arg Thr Ala Ala Ala Ser Gly Gly Gly Gly 
1605 1610 1615 

Leu Val His Lys Leu Ala Gly Arg Pro Ala Glu Glu Gin Glu Ala Val 
1620 1625 1630 

Leu Leu Gly He Val Gin Ala Glu Ala Ala Ala Val Leu Gly Phe Asn 
1635 1640 1645 

Ala Pro Glu Leu Ala Gin Gly Thr Arg Gly Phe Ser Asp Leu Gly Phe 
1650 1655 1660 

Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Ser Ala Ala Thr 
1665 1670 1675 1680 

Gly Val Lys Leu Pro Ala Thr Leu Val Phe Asp Tyr Pro Thr Pro Val 
1685 1690 1695 

Ala Leu Ala Axg His Leu Arg Glu Glu Leu Gly Glu Thr Val Ala Gly 
1700 1705 1710 

Ala Pro Ala Thr Pro Val Thr Thr Val Ala Asp Ala Gly Glu Pro He 
1715 1720 1725 



Ala He Val Gly Met Ala Cys Arg Leu Pro Gly Gly Val Met Ser Pro 
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1730 1735 1740 

Asp Asp Leu Trp Arg Met Val Ala Glu Gly Arg Asp Gly Met Ser Pro 
1745 1750 1755 1760 

Phe Pro Gly Asp Arg Gly Trp Asp Leu Asp Gly Leu Phe Asp Ser Asp 
1765 1770 1775 

Pro Glu Arg Pro Gly Thr Ala Tyr lie Arg Gin Gly Gly Phe Leu His 
1780 1785 1790 

Glu Ala Ala Leu Phe Asp Pro Gly Phe Phe Gly He Ser Pro Arg Glu 
1795 1800 1805 

Ala Leu Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Ala Ser Trp 
1810 1815 1820 

Glu Ala Leu Glu Arg Ala Gly He Asp Pro Thr Lys Ala Arg Gly Asp 
1825 1830 1835 1840 

Ala Val Gly Val Phe Ser Gly Val Ser He His Asp Tyr Leu Glu Ser 
1845 1850 1855 

Leu Ser Asn Met Pro Ala Glu Leu Glu Gly Phe Val Thr Thr Ala Thr 
1860 1865 1870 

Ala Gly Ser Val Ala Ser Gly Arg Val Ser Tyr Thr Phe Gly Phe Glu 
1875 1880 1885 

Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala 
1890 1895 1900 

lie His Leu Ala Ala Gin Ala Leu Arg Gin Gly Glu Cys Thr Met Ala 
1905 1910 1915 1920 



Leu Ala Gly Gly Val Ala Val Met Gly Ser Pro He Gly Val He Gly 
1925 1930 1935 
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Met Ser Arg Gin Arg Gly Met Ala Glu Asp Gly Arg Val Lys Ala Phe 
1940 1945 1950 

Ala Asp Gly Ala Asp Gly Thr Val Leu Ser Glu Gly Val Gly lie Val 
1955 1960 1965 

Val Leu Glu Arg Leu Ser Val Ala Arg Glu Arg Gly His Arg Val Leu 
1970 1975 1980 

Ala Val Leu Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
1985 1990 1995 2000 



3 Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg Val He Arg Ser Ala 

111 



2005 2010 2015 



^- Leu Ala Gly Ala Gly Leu Gin Pro Ser Glu Val Asp Val Val Glu Ala 

m 

I' 2020 2025 2030 

□ 

yi 2035 2040 2045 

B 



His Gly Thr Gly Thr Ala Leu Gly Glu Pro He Glu Ala Gin Ala Leu 



Leu Ala Thr Tyr Gly Lys Ser Arg Glu Thr Pro Leu Trp Leu Gly Ser 
2050 2055 2060 

Leu Lys Ser Asn He Gly His Thr Gin Ala AJLa Ala Gly Val AJLa Ala 
2065 2070 2075 2080 

Val He Lys Met Val Gin Ala Leu Arg Gin Asp Thr Leu Pro Pro Thr 
2085 2090 2095 

Leu His Val Gin Glu Pro Thr Lys Gin Val Asp Trp Ser Ala Gly Ala 
2100 2105 2110 



Val Glu Leu Leu Thr Glu Gly Arg Glu Trp Ala Arg Asn Gly His Pro 
2115 2120 2125 
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Arg Arg Ala Gly Val Ser Ser Phe Gly He Ser Gly Thr Asn Ala His 
2130 2135 2140 

Leu He Leu Glu Glu Ala Pro Ala Asp Asp Thr Ala Glu Ala Asp Val 
2145 2150 2155 2160 

Pro Asp Ala Val Val Pro Val Val He Ser Ala Arg Ser Thr Gly Ser 
2165 2170 2175 

Leu Ala Gly Gin Ala Gly Arg Leu Ala Ala Phe Leu Asp Gly Asp Val 
2180 2185 2190 

Pro Leu Thr Arg Val Ala Gly Ala Leu Leu Ser Thr Arg Ala Thr Leu 
2195 2200 2205 

Thr Asp Arg Ala Val Val Val Ala Gly Ser Ala Glu Glu Ala Arg Ala 
2210 2215 2220 

Gly Leu Thr Ala Leu Ala Arg Gly Glu Ser Ala Ser Gly Leu Val Thr 
2225 2230 2235 2240 

Gly Thr Ala Gly Met Pro Gly Lys Thr Val Trp Val Phe Pro Gly Gin 
2245 2250 2255 

Gly Thr Gin Trp Ala Gly Met Gly Arg Glu Leu Leu Glu Ala Ser Pro 
2260 2265 2270 

Val Phe Ala Glu Arg He Glu Glu Cys Ala Ala Ala Leu Gin Pro Trp 
2275 2280 2285 

He Asp Trp Ser Leu Leu Asp Val Leu Arg Gly Glu Gly Glu Leu Asp 
2290 2295 2300 

Arg Val Asp Val Leu Gin Pro Ala Cys Phe Ala Val Met Val Gly Leu 
2305 2310 2315 2320 

Ala Ala Val Trp Ala Ser Val Gly Val Val Pro Asp Ala Val Leu Gly 



2325 2330 2335 

His Ser Gin Gly Glu He Ala Ala Ala Cys Val Ser Gly Ala Leu Ser 
2340 2345 2350 

Leu Glu Asp Ala Ala Lys Val Val Ala Leu Arg Ser Gin Ala He Ala 
2355 2360 2365 

Ala Glu Leu Ser Gly Arg Gly Gly Met Ala Ser He Gin Leu Ser His 
2370 2375 2380 

Asp Glu Val Ala Ala Arg Leu Ala Pro Trp Ala Gly Arg Val Glu He 
2385 2390 2395 2400 



S Ala Ala Val Asn Gly Pro Ala Ser Val Val He Ala Gly Asp Ala Glu 

m . 2405 2410 2415 

ii. 

V Ala Leu Thr Glu Ala Val Glu Val Leu Gly Gly Arg Arg Val Ala Val 

O 2420 2425 2430 

n 

•as* 



Asp Tyr Ala Ser His Thr Arg His Val Glu Asp He Gin Asp Thr Leu 



U 2435 2440 2445 



Ala Glu Thr Leu Ala Gly He Asp Ala Gin Ala Pro Val Val Pro Phe 
2450 2455 2460 

Tyr Ser Thr Val Ala Gly Glu Trp He Thr Asp Ala Gly Val Val Asp 
2465 2470 2475 2480 

Gly Glv Tyr Trp Tyr Arg Asn Leu Arg Asn Gin Val Gly Phe Gly Pro 
2485 2490 2495 

Ala Val Ala Glu Leu He Glu Gin Gly His Gly Val Phe Val Glu Val 
2500 2505 2510 



Ser Ala His Pro Val Leu Val Gin Pro He Ser Glu Leu Thr Asp Ala 
2515 2520 2525 
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Val Val Thr Gly Thr Leu Arg Arg Asp Asp Gly Gly Val Arg Arg Leu 
2530 2535 2540 

Leu Thr Ser Met Ala Glu Leu Phe Val Arg Gly Val Pro Val Asp Trp 
2545 2550 2555 2560 

Ala Thr Met Ala Pro Pro Ala Arg Val Glu Leu Pro Thr Tyr Ala Phe 
2565 2570 2575 

Asp His Gin His Phe Trp Leu Ser Pro Pro Ala Val Ala Asp Ala Pro 
2580 2585 2590 

Ala Leu Gly Leu Ala Gly Ala Asp His Pro Leu Leu Gly Ala Val Leu 
2595 2600 2605 

Pro Leu Pro Gin Ser Asp Gly Leu Val Phe Thr Ser Arg Leu Ser Val 
2610 2615 2620 

Arg Thr His Pro Trp Leu Ala Asp Gly Val Pro Ala Ala Ala Leu Val 
2625 2630 2635 2640 

Glu Leu Ala Val Arg Ala Gly Asp Glu Ala Gly Cys Pro Val Leu Ala 
2645 2650 2655 

Asp Leu Thr Val Glu Lys Leu Leu Val Leu Pro Glu Ser Gly Gly Leu 
2660 2665 2670 

Arg Val Gin Val He Val Ser Gly Glu Arg Thr Val Glu Val Tyr Ser 
2675 2680 2685 

Gin Leu Glu Gly Ala Glu Asp Trp He Arg Asn Ala Thr Gly His Leu 
2690 2695 2700 

Ser Ala Thr AJ.a Pro Ala His Glu Ala Phe Asp Phe Thr Ala Trp Pro 
2705 2710 2715 2720 
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Pro Ala Gly Ala Gin Gin Val Asp Gly Leu Trp Arg Arg Gly Asp Glu 
2725 2730 2735 

lie Phe Ala Glu Val Ala Leu Pro Glu Glu Leu Asp Ala Gly Ala Phe 
2740 2745 2750 

Gly He His Pro Phe Leu Leu Asp Ala Ala Val Gin Pro Val Leu Ala 
2755 2760 2765 

Asp Asp Glu Gin Pro Ala Glu Trp Arg Ser Leu Val Leu His Ala Ala 
2770 2775 2780 

Gly Ala Ser Ala Leu Arg Val Arg Leu Val Pro Gly Gly Ala Leu Gin 
2785 2790 2795 2800 

Ala Ala Asp Glu Thr Gly Gly Leu Val Leu Thr Ala Asp Ser Val Ala 
2805 2810 2815 

Gly Arg Glu Leu Ser Ala Gly Lys Thr Arg Ala Gly Ser Leu Tyr Arg 
2820 2825 2830 

Val Asp Trp Thr Glu Val Ser He Ala Asp Ser Ala Val Pro Ala Asn 
2835 2840 2845 

He Glu Val Val Glu Ala Phe Gly Glu Glu Pro Leu Glu Leu Thr Gly 
2850 2855 2860 

Arg Val Leu Glu Ala Val Gin Thr Trp Leu Val Thr Ala Ala Asp Asp 
2865 2870 2875 2880 

Ala Arg Leu Val Val Val Thr Arg Gly Ala Val Arg Glu Val Thr Asp 
2885 2890 2895 

Pro Ala Gly Ala Ala Val Trp Gly Leu Val Arg Ala Ala Gin Ala Glu 
2900 2905 2910 



Asn Pro Gly Arg He Phe Leu He Asp Thr Asp Gly Glu He Pro Ala 
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2915 2920 2925 

Leu Thr Gly Asp Glu Pro Glu He Ala Val Arg Gly Gly Lys Phe Phe 
2930 2935 2940 

Val Pro Arg He Thr Arg Ala Glu Pro Ser Gly Ala Ala Val Phe Arg 
2945 2950 2955 2960 

Pro Asp Gly Thr Val Leu He Ser Gly Ala Gly Ala Leu Gly Gly Leu 
2965 2970 2975 

Val Ala Arg Arg Leu Val Glu Arg His Gly Val Arg Lys Leu Val Leu 
2980 2985 2990 

Ala Ser Arg Arg Gly Arg Asp Ala Asp Gly Val Ala Asp Leu Val Ala 
2995 3000 3005 

Asp Leu Ala Ala Asp Val Ser Val Val Ala Cys Asp Val Ser Asp Arg 
3010 3015 3020 

Ala Gin Val Ala Ala Leu Leu Asp Glu His Arg Pro Thr Ala Val Val 
3025 3030 3035 3040 

His Thr Ala Gly Val He Asp Ala Gly Val He Glu Thr Leu Asp Arg 
3045 3050 3055 

Asp Arg Leu Ala Thj: Val Phe Ala Pro Lys Val Asp Ala Val Arg His 
5060 3065 3070 



Leu Asp Glu Leu Thr Arg Asp Arg Asp Leu Asp Ala Phe Val Val Tyr 
3075 3080 3085 

Ser Ser Val Ser Ala Val Phe Met Gly Ala Gly Ser Gly Ser Tyr Ala 
3090 3095 3100 

Ala Ala Asn Ala Phe Leu Asp Gly Leu Met Ala Asn Arg Arg Ala Ala 
3105 3110 3115 3120 
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Gly Leu Pro Gly Leu Ser Leu Ala Trp Gly Leu Trp Asp Gin Ser Thr 
3125 3130 3135 

Gly Met Ala Ala Gly Thr Asp Glu Ala Thr Arg Ala Arg Met Ser Arg 
3140 3145 3150 

Arg Gly Gly Leu Gin He Met Thr Gin Ala Glu Gly Met Asp Leu Phe 
3155 3160 3165 

Asp Ala Ala Leu Ser Ser Ala Glu Ser Leu Leu Val Pro Ala Lys Leu 
3170 3175 3180 

Asp Leu Arg Gly Val Arg Ala Asp Ala Ala Ala Gly Gly Val Val Pro 
3185 3190 3195 3200 

His Met Leu Arg Gly Leu Val Arg Ala Gly Arg Ala Gin Ala Arg Ala 
3205 3210 3215 

Ala Ser Thr Val Asp Asn Gly Leu Ala Gly Arg Leu Ala Gly Leu Ala 
3220 3225 3230 

Pro Ala Asp Gin Leu Thr Leu Leu Leu Asp Leu Val Arg Ala Gin Val 
3235 3240 3245 

Ala Ala Val Leu Gly His Ala Asp Ala Ser AJLa Val Arg Val Asp Thr 
3250 3255 3260 

Ala Phe Lys Asp Ala Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg 
3265 3270 3275 3280 

Asn Arg Met Arg Thr Ala Thr Gly Leu Lys Leu Pro Ala Thr Leu Val 
3285 3290 3295 



Phe Asp Tyr Pro Asn Pro Gin Ala Leu Ala Arg His Leu Arg Asp Glu 
3300 3305 3310 
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Leu Gly Gly Ala Ala Gin Thr Pro Val Thr Thr Ala Ala Ala Lys Ala 
3315 3320 3325 

Asp Leu Asp Glu Pro He Ala He Val Gly Met Ala Cys Arg Leu Pro 
3330 3335 3340 

Gly Gly Val Ala Gly Pro Glu Asp Leu Trp Arg Leu Val Ala Glu Gly 
3345 3350 3355 3360 

Arg Asp Ala Val Ser Ser Phe Pro Thr Asp Arg Gly Trp Asp Thr Asp 
3365 3370 3375 

Ser Leu Tyr Asp Pro Asp Pro Ala Arg Pro Gly Lys Thr Tyr Thr Arg 
3380 3385 3390 

Kis Gly Gly Phe Leu His Glu Ala Gly Leu Phe Asp Ala Gly Phe Phe 
3395 3400 3405 

Gly He Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu 
3410 3415 3420 

Leu Leu Glu Ala Ser Trp Glu Ala Met Glu Asp Ala Gly Val Asp Pro 
3425 3430 3435 3440 

Leu Ser Leu Lys Gly Asn Asp Val Gly Val Phe Thr Gly Met Phe Gly 
3445 3450 3455 

Gin Gly Tyr Val Ala Pro Gly Asp Ser Val Val Thr Pro Glu Leu Glu 
3460 3465 3470 

Gly Phe Ala Gly Thr Gly Gly Ser Ser Ser Val Ala Ser Gly Arg Val 
3475 3480 3485 

Ser Tyr Val Phe Gly Phe Glu Gly Pro Ala Val Thr He Asp Ser Ala 
3490 3495 3500 

Cys Ser Ser Ser Leu Val Ala Met His Leu Ala Ala Gin Ser Leu Arg 
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3505 3510 3515 3520 

Gin Gly Glu Cys Ser Met Ma Leu Ala Gly Gly Ala Thr Val Met Ala 
3525 3530 3535 

Asn Pre Gly Ala Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Val 
3540 3545 3550 

Asp Gly Arg Cys Lys Ala Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp 
3555 3560 3565 

Ala Glu Gly Val Gly Val Val He Leu Glu Arg Leu Ser Val Ala Aj:g 
3570 3575 3580 

Glu Arg Gly His Arg He Leu Ala Val Leu Arg Gly Ser Ala Val Asn 
3585 3590 3595 3600 

Gin Asp Gly AJLa Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin 
3605 3610 3615 

Gin Arg Val He Arg Arg Ala Leu Val Ser Ala Gly Leu Ala Pro Ser 
3620 3625 3630 

Asp Val A^p Val Val Glu Ala His Gly Thr Gly Thr Thr Leu Gly Asp 
3635 3640 3645 

Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Lys Asp Arg Glu 
3650 3655 3660 

Ser Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin 
3665 3670 3675 3680 

Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Gin Ala Leu Arg 
3685 3690 3695 

His Glu Val Leu Pro Pro Thr Leu His Val Asp Arg Pro Thr Pro Glu 
3700 3705 3710 
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Val Asp Trp Ser Ala Gly Ala Val Glu Leu Leu Thr Glu Ala Arg Glu 
3715 3720 3725 

Trp Pro Arg Asn Gly Arg Pro Arg Arg Ala Gly Val Ser Ala Phe Gly 
3730 3735 3740 

Val Ser Gly Thr Asn Ala His Leu lie Leu Glu Glu Ala Pro Ala Glu 
3745 3750 3755 3760 

Glu Pro Val Pro Thr Pro Glu Val Pro Leu Val Pro Val Val Val Ser 
3765 3770 3775 

Ala Arg Ser Arg Ala Ser Leu Ala Gly Gin Ala Gly Arg Leu Ala Gly 
3780 3785 3790 

Phe Val Ala Gly Asp Ala Ser Leu Ala Gly Val Ala Arg Ala Leu Val 
3795 3800 3805 

Thr Asn Arg Ala Ala Leu Thr Glu Arg Ala Val Met Val Val Gly Ser 
3810 3815 3820 

Arg Glu Glu Ala Val Thr Asn L-eu Glu Ala Leu Ala Arg Gly Glu Asp 
3825 3830 3835 3840 

Pro Ala Ala Val Val Thr Gly Arg Ala Gly Ser Pro Gly Lys Leu Val 
3845 3850 3855 

Trp val Phe Pro Gly Gin Gly Ser Gin Trp He Gly Met Gly Arg Glu 
3860 3865 3870 

Leu Leu Asp Ser Ser Pro Val Phe Ala Glu Arg Val Ala Glu Cys Ala 
3875 3830 3885 



Ala Ala Leu Glu Pro Trp He Asp Trp Ser Leu Leu Asp Val Leu Arg 
3890 3895 3900 
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Gly Glu Ser Asp Leu Leu Asp Arg Val A^p Val Val Gin Pro Ala Ser 
3905 3910 3915 3920 

Phe Ala Met Met Val Gly Leu Ala Ala Val Trp Gin Ser Val Gly Val 
3925 3930 3935 

Arg Pro Asp Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala 
3940 5945 3950 

Cys Val Ser Gly Ala Leu Ser Leu Gin Asp Ala Ala Lys Val Val Ala 
3955 3960 3965 

Leu Ajrg Ser Gin Ala He Ala Thr A-rg Leu Ala Gly Arg Gly Gly Met 
3970 3975 3980 

Ala Ser Val Ala Leu Ser Glu Glu Asp Ala Thr Ala Trp Leu Ala Pro 
3985 3990 3995 4000 

Trp AJ.a A^p Arg Val Gin Val Ala Ala Val Asn Ser Pro Ala Ser Val 
4005 4010 4015 

Val He Ala Gly Glu Ala Gin Ala Leu Asp Glu Val Val Asp Ala Leu 
4020 4025 4030 

Ser Gly Gin Glu Val Axg Val A-rg Ajrg Val Ala Val Asp Tyr Gly Ser 
4035 4040 4045 

His Thr Ajsn Gin Val Glu Ala He Glu Asp Leu Leu Ala Glu Thr Leu 
4050 4055 4060 

Ala Gly He Glu Ala Gin Ala Pro Lys Val Pro Phe Tyr Ser Thr Leu 
4065 4070 4075 4080 

He Gly A-sp Trp He Arg Asp Ala Gly He Val Asp Gly Gly Tyr Trp 
4085 4090 4095 



Tyr Arg Asn Leu Arg Asn Gin Val Gly Phe Gly Pro Ala Val Ala Glu 



- 147- 



4100 4105 4110 

Leu Val Arg Gin Gly His Gly Val Phe Val Glu Val Ser Ala His Pro 
4115 4120 4125 

Val Leu Val Gin Pro Leu Ser Glu Leu Ser Asp Asp Ala Val Val Thr 
4130 4135 4140 

Gly Ser Leu Arg Aj:g Glu Asp Gly Gly Leu Arg Arg Leu Leu Thr Ser 
4145 4150 4155 4160 

Met Ala Glu Leu Tyr Val Gin Gly Val Pro Leu Asp Trp Thr Ala Val 
4165 4170 4175 

Leu Pro Arg Thr Gly Arg Val Asp Leu Pro Lys Tyr Ala Phe Asp His 
4180 4185 4190 

Arg His Tyr Trp Leu Arg Pro Ala Glu Ser Ala Thr Asp Ala Ala Ser 
4195 4200 4205 

Leu Gly Gin Ala Ala Ala Asp His Pro Leu Leu Gly Ala Val Val Glu 
4210 4215 4220 

Leu Pro Gin Ser Asp Gly Leu Val Phe Thr Ser Arg Leu Ser Val Arg 
4225 4230 4235 4240 

Thr His Pro Trp Leu Ala Asp His Ala Val Gly Gly Val Val He Leu 
4245 4250 4255 

Pro Gly Ser Gly Leu Ala Glu Leu Ala Val Arg Ala Gly Asp Glu Ala 
4260 4265 4270 

Gly Cys Thr Ala Leu Asp Glu Leu He He Glu Ala Pro Leu Val Val 
4275 4280 4285 



Pro Ala Gin Gly Ala Val Arg Val Gin Val Ala Leu Ser Gly Pro Asp 
4290 4295 4300 
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Glu Thr Gly Ser Arg Thr Val Asp Leu Tyr Ser Gin Arg Asp Gly Gly 
4305 4310 4315 4320 

Ala Gly Thr Trp Thr Arg His Ala Thr Gly Val Leu Ser Thr Ala Pro 
4325 4330 4335 

Ala Gin Glu Pro Glu Phe Asp Phe His Ala Trp Pro Pro Ala Asp Ala 
4340 4345 4350 

Glu Arg He Asp Val Glu Thr Phe Tyr Thr Asp Leu Ala Glu Arg Gly 
4355 4360 4365 

Tyr Gly Tyr Gly Pro Ala Phe Gin Gly Leu Gin Ala Val Trp Arg Arg 
4370 4375 4380 

Asp Gly Asp Val Phe Ala Glu Val Ala Leu Pro Glu Asp Leu Arg Lys 
4385 4390 4395 4400 

Asp Ala Gly Arg Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala Leu 
4405 4410 4415 

Gin Ala Ala Thr Ala Val Gly Gly Asp Glu Pro Gly Gin Pro Val Leu 
4420 4425 4430 

Ala Phe Ala Trp Asn Gly Leu Val Leu His Ala AJ.a Gly Ala Ser Ala 
4435 4440 4445 

Leu Arg Val Arg Leu Ala Pro Ser Gly Pro Asp Thr Leu Ser Val Ala 
4450 4455 4460 

Ala Ala Asp Glu Thr Gly Gly Leu Val Leu Thr Met Glu Ser Leu Val 
4465 4470 4475 . 4480 

Ser Arg Pro Val Ser Ala Glu Gin Leu Gly Ala Ala Ala Asp Ala Gly 
4485 4490 4495 
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His Asp Ala Met Phe Arg Val Asp Trp Thr Glu Leu Pro Ala Val Pro 
4500 4505 4510 

Arg Ala Glu Leu Pro Pro Trp Val Arg He Asp Thr Ala Asp Asp Val 
4515 4520 4525 

Ala Ala Leu Ala Glu Lys Ala A^p Ala Pro Pro Val Val Val Trp Glu 
4530 4535 4540 

Ala Ala Gly Gly Asp Pro AJLa Leu Ala Val Ser Ser Arg Val Leu Glu 
4545 4550 4555 4560 

He Met Gin Ala Trp Leu Ala Ala Pro Ala Phe Glu Glu Ala Arg Leu 
4565 4570 4575 

Val Val Thr Thr Arg Gly Ala Val Pro Ala Gly Gly Asp His Thr Leu 
4580 4585 4590 

Thr Asp Pro Ala Ala Ala Ala Val Trp Gly Leu Val Arg Ser Ala Gin 
4595 4600 4605 

Ala Glu His Pro Asp Arg Val Val Leu Leu Asp Thr Asp Gly Glu Val 
4610 4615 4620 

Pro Leu Gly Ala Val Leu Ala Ser Gly Glu Pro Gin Leu Ala Val Arg 
4625 4630 4635 4640 

Gly Thr Thr Phe Phe Val Pro Arg Leu Ala Arg Ala Thr Arg Leu Ser 
4645 4650 4655 

Asp Ala Pro Pro Ala Phe Asp Pro Asp Gly Thr Val Leu Val Ser Gly 
4660 4665 4670 

Ala Gly Ser Leu Gly Thr Leu Val Ala Arg His Leu Val Thr Arg His 
4675 4680 4685 



Gly Val Arg Arg Val Val Leu Ala Ser Arg Gin Gly Arg Asp Ala Glu 
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4690 4695 4700 



Gly Ala Gin Asp Leu lie Thr Glu Leu Thr Gly Glu Gly Ala Asp Val 
4705 4710 4715 4720 

Ser Phe Val Ala Cys Asp Val Ser Asp Arg Asp Gin Val Ala Ala Leu 
4725 4730 4735 

Leu Ala Gly Leu Pro Asp Leu Thr Gly Val Val His Thr Ala Gly Val 
4740 4745 4750 

Phe Glu Asp Gly Val lie Glu Ala Leu Thj: Pro Asp Gin Leu Ala Asn 
4755 4760 4765 

Val Tyr Ala Ala Lys Val Thr Ala hla Met His Leu Asp Glu Leu Thr 
4770 4775 4780 

Arg Asp Arg Asp Leu Gly Ala Phe Val Val Phe Ser Ser Val Ala Gly 
4785 4790 4795 4800 

Val Met Gly Gly Gly Gly Gin Gly Pro Tyr Ala Ala Ala Asn Ala Phe 
4805 4810 4815 

Leu Asp Ala Ala Met Ala Ser Arg Gin Ala Ala Gly Leu Pro Gly Leu 
4820 4825 4830 

Ser Leu Ala Trp Gly Leu Trp Glu Arg Ser Ser Gly Met Ala Ala His 
4835 4840 4845 

Leu Ser Glu Val Asp His Ala Arg Ala Ser Arg Asn Gly Val Leu Glu 
4850 4855 4860 

Leu Thr Arg Ala Glu Gly Leu AJLa Leu Phe Asp Leu Gly Leu Arg Met 
4865 4870 4875 4880 



SI 
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Ala Glu Ser Leu Leu Val Pro lie Lys Leu Asp Leu Ala Ala Met Arg 
4885 4890 4895 
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Ala Ser Thr Val Pro Val Leu Phe Arg Gly Leu Val Arg Pro Ser Arg 
4900 4905 4910 

Thr Gin Ala Arg Thr Ala Ser Thr Val Asp Arg Gly Leu Ala Gly Arg 
4915 4920 4925 

Leu Ala Gly Leu Pro Val Ala Glu Arg Ala Ala Val Leu Val Asp Leu 
4930 4935 4940 

Val Arg Gly Gin Val Ala Val Val Leu Gly Ty^: Asp Gly Pro Glu Ala 
4945 4950 4955 4960 

Val Arg Pro Asp Thr Ala Phe Lys Asp Thr Gly Phe Asp Ser Leu Thr 
4965 4970 4975 

Ser Val Glu Leu Arg Asn Aj:g Leu Arg Glu Ala Thr Gly Leu Lys Leu 
4980 4985 4990 

Pro Ala Thr Leu Val Phe Asp Tyr Pro Asn Pro Leu Ala Val Ala Arg 
4995 5000 5005 

Tyr Leu Gly Ala Arg Leu Val Pro Asp Gly Thr Ala Asn Gly Asn Gly 
5010 5015 5020 

Asn Gly Asn Gly His Ser Glu Asp Asp Arg Leu Arg His Ala Leu Ala 
5025 5030 5035 5040 

Ala He Ala Ala Glu Asp Ala Gly Glu Glu Arg Ser He Ala Asp Leu 
5045 5050 5055 

Gly Val Asp Asp Leu Val Gin Leu Ala Phe Gly Asp Glu 
5060 5065' 



(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1721 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 6: 

Met Ala Cys Arg Leu Pro Gly Gly Val Thr Gly Pro Gly Asp Leu Trp 
1 5 10 15 

Arg Leu Val Ala Glu Gly Gly Asp Ala Val Ser Gly Phe Pro Thr Asp 
20 25 30 

Arg Cys Trp Asp Leu Asp Thr Leu Phe Asp Pro Asp Pro Asp His Ala 
35 40 45 

Gly Thr Ser Tyr Thr Asp Gin Gly Gly Phe Leu His Asp Ala Ala Leu 
50 55 60 

Phe A^p Pro Gly Phe Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met 
• 65 70 75 80 

Asp Pro Gin Gin Arg Leu Leu Leu Glu AJ.a Ser Trp Glu Ala Leu Glu 
85 90 95 

Gly Val Gly Leu Asp Pro Ala Ser Leu Gin Gly Thr Asp Val Gly Val 
100 105 110 

Phe Thr Gly Ala Gly Gly Ser Gly Tyr Gly Gly Gly Leu Thr Gly Pro 
115 120 125 . 

Glu Met Gin Ser Phe Ala Gly Thr Gly Leu Ala Ser Ser Val Ala Ser 
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130 135 140 

Gly Arg Val Ser Tyr Val Phe Gly Phe Glu Gly Pro Ala Val Thr He 
145 150 155 160 

Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Met His Leu Ala Ala Gin 
165 170 175 

Ala Leu Arg Gin Gly Asp Cys Ser Met Ala Leu Ala Gly Gly Ala Met 
180 185 190 

Val Met Ser Gly Pro Asp Ser Phe Val Val Phe Ser Arg Gin Arg Gly 
195 200 205 

Leu Ala Thr Asp Gly Arg Cys Lys Ala Phe Ala Ser Gly Ala Asp Gly 
210 215 220 

Met Val Leu Ala Glu Gly He Ser Val Val Val Leu Glu Arg Leu Ser 
225 230 235 240 

Val Ala Arg Glu Arg Gly His Arg Val Leu Ala Val Leu Arg Gly Ser 
245 250 255 

Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly 
260 265 270 

Pro Ser Gin Gin Arg Val He Arg Ala Ala Leu Ala Asn Ala Gly He 
275 280 285 

Gly Pro Ser Asp Val Asp Leu Val Glu Ala His Gly Thr Gly Thr Ser 
290 295 300 

Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Gin 
305 310 315 320 



Asp Arg Glu Thr Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly 
325 330 335 



- 154- 



His Thr Gin Ala Ala Ala Gly Val Ala Ser Val He Lys Val Val Gin 
340 345 350 

Ala Leu Arg His Gly Val Met Pro Pro Thr Leu His Val Asp Glu Pro 
355 360 365 

Ser Ser Gin Val Asp Trp Ser Glu Gly Ala Val Glu Leu Leu Thr Gly 
370 375 380 

Ser Arg Asp Trp Pro Arg Gly Asp Arg Pro Arg Arg Ala Gly Val Ser 
335 390 395 400 

Ser Phe Gly Val Ser Gly Thr Asn Val His Leu He He Glu Glu Ala 
405 410 415 

Pro Glu Glu Pro Ala Ala Ala Val Pro Thr Ser Ala Asp Val Val Pro 
420 425 430 

Leu Val Val Ser Ala Arg Ser Thr Gly Ser Leu Ala Gly Gin Ala Asp 
435 440 445 

Arg Leu Thr Glu Val Asp Val Pro Leu Gly His Leu Ala Gly Ala Leu 
450 455 460 

Val Ala Gly Arg Ala Val Leu Glu Glu Arg Ala Val Val Val Ala Gly 
465 470 475 480 

Ser Ala Glu Glu Ala Arg Ala Gly Leu Gly Ala Leu Ala Arg Gly Glu 
485 490 495 



Ala Ala Pro Gly Val Val Thr Gly Thr Ala Gly Lys Pro Gly Lys Val 
500 505 510 



Val Trp Val Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Arg 
515 520 525 
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Glu Leu Leu Asp Ala Ser Pro Val Phe Ala Glu Arg He Lys Glu Cys 
530 535 540 

Ala Ala Ala Leu Asp Gin Trp Thr Asp Trp Ser Leu Leu Asp Val Leu 
545 550 555 560 

Arg Gly Asp Gly Asp Leu Asp Ser Val Glu Val Leu Gin Pro Ala Cys 
565 570 575 

Phe AJ.a Val Met Val Gly Leu Ala Ala Val Trp Glu Ser Ala Gly Val 
580 585 590 

Arg Pro Asp Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala 
595 600 605 

Cys Val Ser Gly Ala Leu Thr Leu Asp Asp Ala Ala Lys Val Val Ala 
610 615 620 

Leu Arg Ser Gin Ala He Ala Ala Arg Leu Ser Gly Arg Gly Gly Met 
625 630 635 640 

Ala Ser Val Ala Leu Ser. Glu Asp Glu Ala Asn Ala Arg Leu Gly Leu 
645 650 655 

Trp Asp Gly Arg He Glu Val Ala Ala Val Asn Gly Pro Ala Ser Val 
660 665 670 

Val He Ala Gly Asp Ala Gin Ala Leu Asp Glu Ala Leu Glu Val Leu 
675 680 685 

Ala Gly Asp Gly Val Arg Val Arg Gin Val Ala Val Asp Tyr Ala Ser 
690 695 700 



His Thr Arg His Val Glu Asp He Arg Asp Thr Leu Ala Glu Thr Leu 
705 710 715 720 



Ala Gly He Thr Ala Gin Ala Pro Asp Val Pro Phe Arg Ser Thr Val 
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725 



730 



735 



Thr Gly Gly Trp Val Arg Asp Ala Asp Val Leu Asp Gly Gly Tyr Trp 
740 745 750 

Tyr Arg Asn Leu Arg Asn Gin Val Arg Phe Gly Pro Ala Val Ala Glu 
755 760 765 

Leu Leu Glu Gin Gly His Gly Val Phe Val Glu Val Ser Ala His Pro 
770 775 780 

Val Leu Val Gin Pro He Ser Glu Leu Thr Asp Ala Val Val Thr Gly 
785 790 795 800 

Thr Leu Arg Arg Asp Asp Gly Gly Leu Arg Arg Leu Leu Thr Ser Met 
805 810 815 

Ala Glu Leu Phe Val Arg Gly Val Arg Val Asp Trp Ala Thj: Leu Val 
820 825 830 

Pro Pro Ala Arg Val Asp Leu Pro Thr Tyr Ala Phe Asp His Gin His 
835 840 845 

Phe Trp Leu Arg Pro Ala Ala Gin Ala Asp Ala Val Ser Leu Gly Gin 
850 855 860 

Ala Ala Ala. Glu His Pro Leu Leu Gly Ala Val Val Arg Leu Pro Gin 
865 870 875 880 

Ser Asp Gly Leu Val Phe Thr Ser Arg Leu Ser Leu Arg Thr His Pro 
885 890 895 



Trp Leu Ala Asp His Thr He Gly Gly Val Val Leu Phe Pro Gly Thr 
900 905 910 



Gly Leu Val Glu Leu Ala Val Arg Ala Gly Asp Glu Ala Gly Cys Pro 
915 920 925 
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Val Leu Asp Glu Leu Val Thr Glu Ala Pro Leu Val Val Pro Gly Gin 
930 935 940 

Gly Gly Val Asn Val Gin Val Thr Val Ser Gly Pro Asp Gin Asn Gly 
945 950 955 960 

Leu Arg Thr Val Asp He His Ser Gin Arg Asp Asp Val Trp Thr Arg 
965 970 975 

His Ala Thr Gly Thr Val Ser Ala Thr Pro Ala Ser Ser Pro Gly Phe 
980 985 990 

Ajsp Phe Thr Ala Trp Pro Pro Pro Asp Gly Gin Arg Val Glu He Gly 
995 1000 1005 

Asp Phe Tyr Ala Asp Leu Ala Glu Arg Gly Tyr Ala Tyr Gly Pro Leu 
1010 1015 1020 

Phe Gin Gly Val Arg Ala Val Trp Gin Arg Gly Glu Asp Val Phe Ala 
1025 1030 1035 1040 

Glu Val Ala Leu Pro Glu Asp Arg Arg Glu Asp Ala Ala Arg Phe Gly 
1045 1050 1055 

Leu His Pro Ala Leu Leu Asp Ala Ala Leu Gin Thr Gly Thr He Ala 
1060 1065 1070 



Ala Ala Ala Ser Gly Gin Pro Gly Lys Ser Val Met Pro Phe Ser Trp 
1075 1080 1085 

Asn Arg Leu Ala Leu His Ala Val Gly Ala Ala Gly Leu Arg Val Arg 
1090 1095 1100 

Val Ala Pro Gly Gly Pro Asp Ala Leu Thr Val Glu Ala Ala Asp Glu 
1105 1110 1115 1120 
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Thr Gly Ala Pro Val Leu Thr Met Asp Ser Leu lie Leu Arg Glu Val 
1125 1130 1135 

Ala Leu Asp Gin Leu Asp Thr Ala Arg Ala Gly Ser Leu Tyr Arg Val 
1140 1145 1150 

Asp Trp Thr Pro Leu Pro Thr Val Asp Ser Ala Val Pro Ala Gly Arg 
1155 1160 1165 

Ala Glu Val Leu Glu AJLa Phe Gly Glu Glu Pro Leu Asp Leu Thr Gly 
1170 1175 1180 

Arg Val Leu Ala Ala Leu Gin Ala Trp Leu Ser Asp Ala Ala Glu Glu 
1185 1190 1195 1200 

Ala Arg Leu Val Val Val Thr Arg Gly Ala Val Pro Ala Gly Asp Gly 
1205 1210 1215 

Val Val Ser Asp Pro Ala Gly Ala Ala Val Trp Gly Leu Val Arg Ala 
1220 1225 1230 

Ala Gin Ala Glu Asn Pro Asp Arg Phe Val Leu Leu Asp Thr Asp Gly 
1235 1240 1245 

Glu Val Pro Leu Glu Ala Val Leu Ala Thr Gly Glu Pro Gin Leu Ala 
1250 1255 1260 

Leu Arg Gly Thj: Thr Phe Ser Val Pro Arg Leu Ala Arg Val Thr Glu 
1265 1270 1275 1280 

Pro Ala Glu Ala Pro Leu Thr Phe Arg Pro Asp Gly Thr Val Leu Val 
1285 1290 1295 

Ser Gly Ala Gly Thr Leu Gly Ala Leu Ala Ala Arg Asp Leu Val Thr 
1300 1305 1310 

Arg His Gly Val Arg Arg Leu Val Leu Ala Ser Arg Arg Gly Arg Ala 
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1315 1320 1325 

Ala Glu Gly lie Asp Asp Leu Val Ala Glu Leu Thr Gly His Gly Ala 
1330 1335 1340 

Glu Val Thr Val Ala Ala Cys Asp Val Ser Asp Arg Asp Gin Val Ala 
1345 1350 1355 1360 

Ala Leu Leu Lys Glu His Ala Leu Thr Ala Val Val His Thr Ala Gly 
1365 1370 1375 

Val Phe Asp Ala Gly Val Thr Gly Ala Leu Thr Arg Glu Arg Leu Ala 
1380 1385 1390 

Lvs Val Phe Ala Pro Lys Val Asp Ala Ala Asn His Leu Asp Glu Leu 
1395 1400 1405 

Thr Arg Asp Leu Asp Leu Asp Ala Phe lie Val Tyr Ser Ser Ala Ser 
1410 1415 1420 

Ser lie Phe Met Gly Ala Gly Ser Gly Gly Tyr Ala Ala Ala Asn Ala 
1425 1430 1435 1440 

Tyr Leu Asp Gly Leu Met AJ.a Ala Arg Arg Ala Ala Gly Leu Pro Gly 
1445 1450 1455 

Leu Ser Leu Ala Trp Gly Pro Trp Glu Gin Leu Thr Gly Met Ala Asp 
1460 1465 1470 

Thr lie Asp Asp Leu Tb-r Leu Ala Arg Met Ser Arg Arg Glu Gly Arg 
1475 1480 1485 

Gly Gly Val Arg Ala Leu Gly Ser Ala Asp Gly Met Glu Leu Phe Asp 
1490 1495 1500 



Ala Ala Leu Ala AJ.a Gly Gin Ala Leu Leu Val Pro He Glu Leu Asp 
1505 1510 1515 1520 
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Leu Arg Glu Val Arg Ala Asp Ala Ala Gly Gly Gly Thr Val Pro His 
1525 1530 1535 

Leu Leu Arg Gly Leu Val Arg Ala Gly Arg Gin Ala Ala Arg Thr Ala 
1540 1545 1550 

Ala Thr Glu Asp Gly Gly Leu Glu Arg Arg Leu Ala Gly Leu Thr Val 
1555 1560 1565 

Ala Glu Gin Glu Ala Leu Leu Leu Asp Leu Val Arg Gly Gin Val Ala 
1570 1575 1580 

Val Val Leu Gly His Ala Asp Ser Ser Gly Val Arg Ala Asp Ala Ala 
1585 1590 1595 1600 

Phe Lys Asp Ala Gly Phe Asp Ser Leu Thr Ser Val Glu Leu Arg Asn 
1605 1610 1615 

Arg Leu Arg Glu Thr Thr Gly Leu Lys Leu Pro Ala Thr Leu Val Phe 
1620 1625 1630 



Asp His Pro Asn Pro Leu Ala Leu Ala Arg His Leu Arg Ala Glu Leu 
1635 1640 1645 

Ala Val Asp Glu Ala Ser Pro Ala Asp Ala Val Leu Ala Gly Leu Ala 
1650 .1655 1660 

Gly Leu Glu Ala Ala lie Ala Ala Ala Gly Ala Pro Asp Gly Asp Arg 
1665 1670 1675 1680 

lie Thr Ala Arg Leu Arg Glu Leu Leu Lys Ala Ala Glu Ala Ala Glu 
1685 1690 1695 

Ala Arg Pro Gly Thr Ser Gly Asp Leu Asp Thr Ala Ser Asp Glu Glu 
1700 1705 1710 
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Leu Phe Ala Leu Val Asp Gly Leu Asp 
1715 1720 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1688 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Ala Cys Ajcg Tyr Pro Gly Gly Val Ser Ser Pro Glu Asp Leu Trp 
15 10 15 

Arg Leu Val AJ.a Glu Gly Thr Asp Ala Val Ser Ala Phe Pro Gly Asp 
20 25 30 

Arg Gly Trp Asp Val Asp Gly Leu Val Asp Pro Asp Pro Asp Arg Pro 
35 40 45 

Gly Thr Thr Tyr Thr Asp Gin Gly Gly Phe Leu His Glu Ala Gly Leu 
50 55 60 

Phe Asp Ala Gly Phe Phe Gly lie Ser Pro Arg Glu Ala Val Ala Met 
65 70 75 80 

Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala He Glu 
85 90 95 

Arg Thr Gly Thr Asp Pro Leu Ser Leu Lys Gly Ser Asp He Gly Val 



100 



105 



110 



Phe Thr Gly Val Ala Ser Met Gly Tyr Gly Ala Gly Gly Gly Val Val 
115 120 125 



Ala Pro Glu Leu Glu Gly Phe Val Gly Thr Gly Ala Ala Pro Cys lie 
130 135 140 

Ala Ser Gly Arg Val Ser Tyr Val Leu Gly Phe Glu Gly Pro Ala Val 
145 150 155 160 

Thr Val Asp Thr Gly Cys Ser Ser Ser Leu Val Ala Met His Leu Ala 
165 170 175 

Ala Gin Ala Leu Arg A-rg Gly Glu Cys Ser Met Ala Leu Ala Gly Gly 
180 185 190 

Ala Met Val Met Ala Gin Pro Gly Ser Phe Val Ser Phe Ser Arg Gin 
155 200 205 

Arg Gly Leu Ala Leu A^p Gly Axg Cys Lys Ala Phe Ser Asp Ser Ala 
210 215 220 

Asp Gly Met Gly Leu Ala Glu Gly Val Gly Val He Ala Leu Glu Ajrg 
225 230 235 240 

Leu Ser Val Ala Arg Glu Axg Gly His Arg Val Leu Ala Val Leu Arg 
245 250 255 

Gly He Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
260 265 270 

Asn Gly Pro Ser Gin Gin Arg Val He Arg Ala Ala Leu Ala Glu Ala 
275 280 285 



Gly Leu Ser Pro Ser Asp Val A^p Ala Val Glu Gly His Gly Thr Gly 
290 295 300 
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Thr Thr Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu Ala Thr Tyr 
305 310 315 320 

Gly Lys Gly Arg Asp Pro Glu Lys Pro Leu Trp Leu Gly Ser Val Lys 
325 330 335 

Ser Asn Leu Gly His Thr Gin Ala Ala Ala Gly Val Ala Ser Val He 
340 345 350 

Lys Met Val Gin Ala Leu Arg His Gly Val Leu Pro Pro Thr Leu His 
355 360 365 

Val Asp Arg Pro Ser Thr Glu Val Asp Trp Ser Ala Gly Ala Val Ser 
370 375 380 

Leu Leu Thr Glu Ala Arg Glu Trp Pro Arg Glu Gly Arg Pro Arg Arg 
385 390 395 400 

Ala Gly Val Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Leu He 
405 410 415 

Leu Glu Glu Ala Pro Glu Glu Glu Pro Pro Val Ala Glu Ala Pro Ser 
420 425 430 

Ala Gly Val Val Pro Val Val Val Ser Ala Arg Gly Ala Leu Ala Gly 
435 440 445 

Gin Ala Gly Arg Leu Ala Ala Phe Leu Glu Ala Ser Asp Glu Pro Leu 
450 455 460 

Val Thr Val Ala Gly Ala Leu He Cys Gly Arg Ser Arg Phe Gly Asp 
465 470 475 480 



Arg Ala Val Val Val Ala Gly Thr Arg Ala Glu Ala Thr Ala Gly Leu 
485 490 495 
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Ala Ala Leu Ala Arg Gly Glu Ser Ala Ala Asp Val Val Thr Gly Thr 
500 505 510 

Val Ala AJ.a Ser Gly Val Pro Gly Lys Leu Val Trp Val Phe Pro Gly 
515 520 525 

Gin Gly Ser Gin Trp Val Gly Met Gly Arg Glu Leu Leu Glu Ala Ser 
530 535 540 

Pro Val Phe Ala Ala Arg He Ala Glu Cys Ala Ala Ala Leu Glu Pro 
545 550 555 560 

Trp He Asp Trp Ser Leu Leu Asp Val Leu Arg Gly Glu Gly Asp Leu 
565 570 575 

Asp Arg Val Asp Val Val Gin Pro Ala Ser Phe Ala Val Met Val Gly 
580 585 590 

Leu Ala Ala Val Trp Ser Ser Val Gly Val Val Pro Asp Ala Val Leu 
595 600 605 

Gly Kis Ser Gin Gly Glu He Ala Ala Ala Cys Val Ser Gly Ala Leu 
610 615 620 

Ser L-eu Gin Asp Ala Ala Lys Val Val Ala Leu Arg Ser Gin Ala He 
625 630 635 640 

Ala Ala Lys Leu Ala Gly Arg Gly Gly Met Ala Ser Val Ala Leu Ser 
645 650 655 

Glu Glu Asp Ala Val Ala Arg Leu Arg His Trp Ala Asp Arg Val Glu 
660 665 670 



Val Ala Ala Val Asn Ser Pro Ser Ser Val Val He Ala Gly Asp Ala 
675 680 685 



Glu Ala Leu Asp Gin Ala Leu Glu Ala Leu Thr Gly Gin Asp He Arg 
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690 695 700 

Val Arg Arg Val Ala Val Asp Tyr Ala Ser His Thr Arg His Val Glu 
705 710 715 720 

Asp He Gin Glu Pro Leu Ala Glu Ala Leu Ala Gly He Glu Ala His 
725 730 735 

Ala Pro Thr Leu Pro Phe Phe Ser Thr Leu Thr Gly Asp Trp He Arg 
740 745 750 

Glu Ala Gly Val Val Asp Gly Gly Tyr Trp Tyr Arg Asn Leu Arg A^n 
755 760 765 

Gin Val Gly Phe Gly Pro Ala Val Ala Glu Leu Leu Gly Leu Gly His 
770 775 780 

Arg Val Phe Val Glu Val Ser Ala His Pro Val Leu Val Gin Ala He 
795 790 795 800 

Ser AJ.a He Ala Asp Asp Thr Asp Ala Val Val Thr Gly Ser Leu Arg 
805 " 810 815 

Arg Glu Glu Gly Gly Leu Arg Arg Leu Leu Thr Ser Met Ala Glu Leu 
820 825 830 

Phe Val Arg Gly Val Asp Val Asp Trp Ala Thr Met Val Pro Pro Ala 
835 840 845 

Arg Val Asp Leu Pro Thr Tyr Ala Phe Asp His Gin His Tyr Trp Leu 
850 855 860 

Axg Tyr Val Glu Thr Ala Th-r Asp Ala Ala Gly Pro Val Val Arg Leu 
865 870 875 880 



Pro Gin Thr Gly Gly Leu Val Phe Thr Thr Glu Trp Ser Leu Lys Ser 
885 890 895 
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Gln Pro Trp Leu Ala Glu His Thr Leu Glu Asp Leu Val Val Val Pro 
900 905 910 

Gly Ala Ala Leu Val Glu Leu Ala Val Arg Ala Gly Asp Glu Ala Gly 
915 920 925 

Thr Pro Val Leu Asp Glu Leu Val He Glu Thr Pro Leu Val Val Pro 
930 935 940 

Glu Arg Gly Ala He Arg Val Gin Val Thr Val Ser Gly Pro Asp Asp 
945 950 955 960 

Gly Thr Arg Thr Leu Glu Val His Ser Gin Pro Glu Asp Ala Thr Asp 
965 970 975 

Glu Trp Thr Arg His Ala Thr Gly Thr Leu Ser Ala Thr Pro Asp Glu 
980 985 990 ' 

Ser Ser Gly Phe Asp Phe Thr Ala Trp Pro Pro Pro Gly Ala Arg Gin 
995 1000 1005 

Leu Asp Gly Val Pro Ala He Trp Arg Ala Gly Asp Glu He Phe Ala 
1010 1015 1020 

Glu Val Ser Leu Pro Asp Asp Ala Asp Ala Glu Ala Phe Gly He His 
1025 1030 1035 1040 

Pro Ala Leu Leu Asp Ala Ala Leu His Pro Ala Leu Pro Gly Asp Asp 
1045 1050 1055 

Gly Leu Thr Gin Pro Met Glu Trp Arg Gly Leu Thr Leu His Ala Ala 
1060 1065 1070 



Gly Ala Ser Thr Leu Arg Val Arg Leu Val Pro Gly Gly Phe Leu Glu 
1075 1080 1085 
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Ala Ala Asp Gly AJ.a Gly Ser Leu Val Val Thr Ala Lys Glu Val Ala 
1090 1095 1100 

Leu Arg Pro Val Thr He Ala Arg Ser Arg Thr Thr Thr Arg Asp Ser 
1105 1110 1115 1120 

Leu Phe Gin Leu Asn Trp He Glu Leu Pro Glu Ser Gly Val Val Ala 
1125 1130 1135 

Ala Ala Asp Asp Thr Glu Val Leu Glu Val Pro Ala Gly Asp Ser Pro 
1140 1145 1150 

Leu Ala Ala Thr Ser Ajrg Val Leu Glu Arg Leu Gin Thr Trp Leu Thr 
1155 1160 1165 

Glu Pro Glu AJLa Glu Gin Leu Val Val Val Thr Arg Gly Ala Val Pro 
1170 1175 1180 

Ala Gly Asp Thr Pro Val Thr Asp Pro Ala Ala Ala Ala Val Trp Gly 
1185 1190 1195 1200 

Leu Val Arg Ser Ala Gin Ala Glu Asn Pro Asp Arg He Val Leu Leu 
1205 1210 1215 

Asp Thr Asp Gly Glu Val Pro Leu Gly Ala Val Leu Ala Gly Gly Glu 
1220 1225 1230 

Pro Gin Val Ala Val Arg Gly Thr Ala Leu Tyr Val Pro Arg Leu Ala 
1235 1240 1245 

Arc Ala Asp Ala Ala Pro Val Ser Gly Leu His Gly Thr Val Leu Val 
1250 1255 1260 

Ser Gly Ala Gly Val Leu Gly Glu He Val Ala Arg His Leu Val Thr 
1265 1270 1275 1280 

Arg His Gly Val Arg Lys L^u Val Leu Ala Ser Arg Arg Gly Leu Asp 
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1285 1290 1295 

Ala Asp Gly Ala Lys Asp Leu Val Thr Asp Leu Thr Gly Glu Gly Ala 
1300 1305 1310 



Asp Val Ser Val Val AJLa Cys Asp Leu Ala Asp Arg Asn Gin Val Ala 
1315 1320 1325 

Ala Leu Leu Ala Asp His Arg Pro Ala Ser Val He His Thr Ala Gly 
1330 1335 1340 

Val Leu Asp Asp Gly Val He Gly Thr Leu Thr Pro Glu Arg Leu Ala 
1345 1350 1355 1360 

Lys Val Phe Ala Pro Lys Val Asp Ala Val Arg His Leu Asp Glu Leu 
1365 1370 1375 

Thr Arg Asp Leu Asp Leu Asp AJ.a Phe Val Val Phe Ser Ser Gly Ser 
1380 1385 1390 

Gly Val Phe Gly Ser Pro Gly Gin Gly Asn Tyr Ala Ala Ala Asn Ala 
1395 1400 1405 

Phe Leu Asp Ala Ala Met Ala Ser Arg Arg Ala Ala Gly Leu Pro Gly 
1410 1415 1420 

Leu Ser Leu AJLa Trp Gly Leu Trp Glu Gin Ala Thr Gly Met Thr Ala 
1425 1430 1435 1440 

His Leu Gly Gly Thr Asp Gin Ala Arg Met Ser Arg Gly Gly Val Arg 
1445 1450 1455 

Pro He Thr Ala Glu Glu Gly Met Ala Leu Phe Asp Thr Ala Leu Gly 
1460 1465 1470 



Ala Gin Pro Ala Leu Leu Val Pro Val Lys Leu Asp Leu Arg Glu Val 
1475 1480 1485 
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Arg Ala Gly Gly Ala Val Pro His Leu Leu Arg Gly Leu Val Arg Ala 
1490 1495 1500 

Gly Arg Arg Gin Ala Gin Ala Ala Ser Thr Val Asp Asn Gin Leu Leu 
1505 1510 1515 1520 

Gly Arg Leu Ala Gly Leu Gly Ala Pro Glu Gin Glu Ala Leu Leu Val 
1525 1530 1535 

Asp Leu Val Arg Gly Gin Val Ala Ala Val Leu Gly His Ala Gly Pro 
1540 1545 1550 

Asp Ala Val Arg Ala Asp Thr Ala Phe Lys Asp Ala Gly Phe Asp Ser 
1555 1560 1565 

Leu Thr Ser Val Asp Leu Arg Asn Arg Leu Arg Glu Ser Thr Gly Leu 
1570 1575 1580 

Lys Leu Pro Ala Thr Leu Ala Phe Asp Tyr Pro Thr Pro Leu Val Leu 
1535 1590 1595 1600 

Ala Arg His Leu Arg Asp Glu Leu Gly Ala Gly Asp Asp Ala Leu Ser 
1605 1610 1615 

Val Val His Ala Arg Leu Glu Asp Val Glu Ala Leu Leu Gly Gly Leu 
1620 1625 1630 

Arg Leu Asp Glu Ser Thr Lys Thr Gly Leu Thr Leu Arg Leu Gin Gly 
1635 1640 1645 

Leu Val Ala Arg Cys Asn Gly Val Asn Asp Gin Thr Gly Gly Glu Thr 
1650 1655 ' 1660 



Leu Ala Asp Arg Leu Glu Ala Ala Ser Ala Asp Glu Val Leu Asp Phe 
1665 1670 1675 1680 
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Ile Asp Glu Glu Leu Gly Leu Thr 
1685 

INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3413 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met ^ Ala Thr Asp Glu Lys Leu Leu Lys Tyr Leu Lys Arg Val Thr Ala 
15 10 15 

Glu Leu His Ser Leu Arg Lys Gin Gly Ala Arg. His Ala Asp Glu Pro 
20 25 30 

Leu Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly Gly Val Ser Ser 
35 40 45 

Pro Glu Asp Leu Trp Gin Leu Val Ala Gly Gly Val Asp Ala Leu Ser 
50 55 60 

Asp Phe Pro Asp Asp A-rg Gly Trp Glu Leu Asp Gly Leu Phe Asp Pro 
65 70 75 80 

Asp Pro Asp His Pro Gly Thr Ser Tyr Thr Ser Gin Gly Gly Phe Leu 
85 90 95 

Pjcg Gly Ala Gly Leu Phe Asp Ala Gly Leu Phe Gly He Ser Pro Arg 
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100 105 110 

Glu Ala Leu Val Met Asp Pro Gin Gin Arg Val Leu Leu Glu Thr Ser 
115 120 125 

Trp Glu Ala Leu Glu Asp Ala Gly Val Asp Pro Leu Ser Leu Lys Gly 
130 135 140 

Ser Asp Val Gly Val Phe Ser Gly Val Phe Thr Gin Gly Tyr Gly Ala 
145 150 155 160 

Gly Ala He Thr Pro Asp Leu Glu Ala Phe Ala Gly He Gly Ala Ala 
165 170 175 

Ser Ser Val Ala Ser Gly Arg Val Ser Tyr Val Phe Gly Leu Glu Gly 
180 185 190 

Pro Ala Vai Thr He Asp Thr Ala Cys Ser Ser Ser Leu Val Ala He 
195 200 205 



His Leu Ala Ala Gin Ala Leu Arg Ala Gly Glu Cys Ser Met Ala Leu 
210 215 220 

Ala Gly Gly Ala Thr Val Met Pro Thr Pro Gly Thr Phe Val Ala Phe 
225 230 235 240 

Ser Arg Gin Arg Val Leu AJLa AJLa Asp Gly Arg Ser Lys Ala Phe Ser 
245 250 255 



Ser Thr AJ.a Asp Gly Thr Gly Trp Ala Glu Gly Ala Gly Val Leu Val 
260 265 270 

Leu Glu Arg Leu Ser Val Ala Gin Glu Arg Gly His Arg He Leu Ala 
275 280 285 



Val Leu Arg Gly Ser AJLa Val Asn Gin Asp Gly Ala Ser Asn Gly Leu 
290 295 300 
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Thr Ala Pro Asn Gly Pro Ser Gin Gin Arg Val He Arg Lys Ala Leu 
305 310 315 320 

Ala Gly Ala Gly Leu Val Ala Ser Asp Val Asp Val Val Glu Ala His 
325 330 335 

Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu 
340 345 350 

Ala Thr Tyr Gly Gin Gly Arg Glu Arg Pro Leu Trp Leu Gly Ser Val 
355 360 365 

Lys Ser Asn Phe Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val 
370 375 380 

He Lys Met Val Gin Ala Leu Arg His Gly Ala Met Pro Pro Thr Leu 
385 390 395 400 

His Val Ala Glu Pro Thr Pro Glu Val Asp Trp Ser Ala Gly Ala Val 
405 410 415 

Glu Leu Leu Thr Glu Pro Arg Glu Trp Pro Ala Gly Asp Arg Pro Arg 
420 425 430 

Arg Ala Gly Val Ser Ala Phe Gly He Ser Gly Thr Asn Ala His Leu 
435 440 445 

He Leu Glu Glu Ala Pro Pro Ala Asp Ala Val Ala Glu Glu Pro Glu 
450 455 460 



Phe Lys Gly Pro Val Pro Leu Val Val Ser Ala Gly Ser Pro Thr Ser 
465 470 475 480 



Leu Ala Ala Gin Ala Gly Arg Leu Ala Glu Val Leu Ala Ser Gly Gly 
485 490 495 



- 173- 



Val Ser Arg Ala Arg Leu Ala Ser Gly Leu Leu Ser Gly Arg Ala Leu 
500 505 510 

Leu Gly Asp Arg Ala Val Val Val Ala Gly Thr Asp Glu A^p Ala Val 
515 • 520 525 

Ala Gly Leu Arg Ala Leu Ala Ajrg Gly Asp A-rg Ala Pro Gly Val Leu 
530 535 540 

Thr Gly Ser Ala Lys His Gly Lys Val Val Tyr Val Phe Pro Gly Gin 
545 . 550 555 560 

Gly Ser Gin Arg Leu Gly Met Gly Arg Glu Leu Tyr Asp Arg Tyr Pro 
565 570 575 

Val Phe Ala Thr Ala Phe Asp Glu Ala Cys Glu Gin Leu Asp Val Cys 
580 585 590 

Leu Ala Gly Arg Ala Gly His Arg Val Arg Asp Val Val Leu Gly Glu 
595 600 605 

Val Pro Ala Glu Thr Gly Leu Leu Asn Gin Thr Val Phe Thr Gin Ala 
610 615 620 

Gly Leu Phe Ala Val Glu Ser Ala Leu Phe Arg Leu A-la Glu Ser Trp 
625 630 635 640 

Gly Val Arg Pro A^p Val Val Leu Gly His Ser lie Gly Glu He Thr 
645 650 655 

Ala Ala Tyr Ala Ala Gly Val Phe Ser Leu Pro Asp AJ.a Ala Arg He 
660 665 670 

Val Ala Ala Arg Gly Arg Leu Met Gin Ala Leu Ala Pro Gly Gly Ala 
675 680 685 



Met Val Ala Val Ala Ala Ser Glu Ala Glu Val Ala Glu Leu Leu Gly 
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690 695 700 

Asp Gly Val Glu Leu Ala Ala Val Asn Gly Pro Ser Ala Val Val Leu 
705 710 715 720 

Ser Gly Asp Ala Asp Ala Val Val Ala Ala Ala Ala Arg Met Arg Glu 
725 730 735 

Arg Gly His Lys Thr Lys Gin Leu Lys Val Ser His Ala Phe His Ser 
740 745 750 

Ala Arg Met Ala Pro Met Leu Ala Glu Phe Ala Ala Glu Leu Ala Gly 
755 760 765 

Val Thr Trp Arg Glu Pro Glu lie Pro Val Val Ser Asn Val Thr Gly 
770 775 780 

Arg Phe Ala Glu Pro Gly Glu Leu Thr Glu Pro Gly Tyr Trp Ala Glu 
785 790 795 800 

His Val Arg Arg Pro Val Arg Phe Ala Glu Gly Val Ala Ala Ala Thr 
805 810 815 

Glu Ser Gly Gly Ser Leu Phe Val Glu Leu Gly Pro Gly Ala Ala Leu 
820 825 830 

Thr Ala Leu Val Glu Glu Thr Ala Glu Val Th-r Cys Val Ala Ala Leu 
835 840 845 

Arg Asp Asp Arg Pro Glu Val Thr Ala Leu lie Thr Ala Val Ala Glu 
850 855 860 

Leu Phe Val Arg Gly Val AJLa Val Asp Trp Pro Ala Leu Leu Pro Pro 
865 870 875 880 



Val Thr Gly Phe Val Asp Leu Pro Lys Tyr Ala Phe Asp Gin Gin His 
885 890 895 
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Tyr Trp Leu Gin Pro Ala Ala Gin Ala Thr Asp Ala Ala Ser Leu Gly 
900 905 910 

Gin Val AJLa Ala Asp His Pro Leu Leu Gly Ala Val Val Arg Leu Pro 
915 920 925 

Gin Ser Asp Gly Leu Val Phe Thr Ser Arg Leu Ser Leu Lys Ser His 
930 935 940 

Pro Trp Leu Ala Asp His Val He Gly Gly Val Val Leu Val Ala Gly 
945 950 955 960 

Thr Gly Leu Val Glu Leu Ala Val Arg Ala Gly Asp Glu Ala Gly Cys 
965 970 975 

Pro Val Leu Glu Glu Leu Val He Glu Ala Pro Leu Val Val Pro Asp 
980 985 990 

His Gly Gly Val Arg He Gin Val Val Val Gly Ala Pro Gly Glu Thr 
995 1000 1005 

Gly Ser Arg Ala Val Glu Val Tyr Ser Leu Arg Glu Asp Ala Gly Ala 
1010 1015 1020 

Glu Val Trp A-la Arg His Ala Thr Gly Phe Leu Ala Ala Thr Pro Ser 
1025 1030 1035 1040 

Gin His Lys Pro Phe Asp Phe Thr Ala Trp Pro Pro Pro Gly Val Glu 
1045 1050 1055 

Arg Val Asp Val Glu Asp Phe Tyr Asp Gly Leu Vai Asp Arg Gly Tyr 
1060 1065 1070 



Ala Tyr Gly Pro Ser Phe Arg Gly Leu Arg Ala Val Trp Arg Arg Gly 
1075 1080 1085 
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Asp Glu Val Phe Ala Glu Val Ala Leu Ala Glu Asp Asp Arg Ala Asp 
1090 1095 1100 

Ala Ala Arg Phe Gly He His Pro Gly Leu Leu Asp Ala Ala Leu His 
1105 1110 1115 1120 

Ala Gly Met Ala Gly Ala Thr Thr Thr Glu Glu Pro Gly Arg Pro Val 
1125 1130 1135 

Leu Pro Phe Ala Trp Asn Gly Leu Val Leu His Ala Ala Gly Ala Ser 
1140 1145 1150 

Ala Leu Arg Val Arg Leu Ala Pro Ser Gly Pro Asp Ala Leu Ser Val 
1155 1160 1165 

Glu Ala Ala Asp Glu Ala Gly Gly Leu Val Val Thr Ala Asp Ser Leu 
1170 1175 1180 

Val Ser Arg Pro Val Ser Ala Glu Gin Leu Gly Ala Ala Ala Asn His 
1185 IISO 1195 1200 

Asp Ala Leu Phe Arg Val Glu Trp Thr Glu He Ser Ser Ala Gly Asp 
1205 1210 1215 

Val Pro Ala Asp His Val Glu Val Leu Glu Ala Val Gly Glu Asp Pro 
1220 1225 1230 

Leu Glu Leu Thr Gly Arg Val Leu Glu Ala Val Gin Thr Trp Leu Ala 
1235 1240 1245 

Asp Ala Ala Asp Asp Ala Arg Leu Val Val Val Thr Arg Gly Ala Val 
1250 1255 1260 

His Glu Val Thr Asp Pro Ala Gly Ala Ala Val Trp Gly Leu He Arg 
1265 1270 1275 1280 



Ala Ala Gin Ala Glu Asn Pro Asp Arg He Val Leu Leu Asp Thr Asp 



1285 1290 1295 

Gly Glu Val Pro Leu Gly Arg Val Leu Ala Thr Gly Glu Pro Gin Thr 
1300 1305 1310 

Ala Val Arg Gly Ala Thr Leu Phe Ala Pro Arg Leu Ala Arg Ala Glu 
1315 1320 1325 

Ala Ala Glu Ala Pro Ala Val Thr Gly Gly Thr Val Leu He Ser Gly 
1330 1335 1340 

Ala Gly Ser Leu Gly Ala Leu Thr Ala Arg His Leu Val Ala Arg His 
1345 1350 ■ 1355 1360 

Gly Val Arg Arg Leu Val Leu Val Ser Arg Arg Gly Pro Asp Ala Asp 
1365 1370 1375 

Gly Met Ala Glu Leu Thr Ala Glu Leu He Ala Gin Gly Ala Glu Val 
1380 1385 1390 

Ala Val Val Ala Cys Asp Leu Ala Asp Arg Asp Gin Val Arg Val Leu 
1395 1400 1405 

Leu Ala Glu His Ajrg Pro Asn Ala Val Val His Thr Ala Gly Val Leu 
1410 1415 1420 

Asp Asp Gly Val Phe Glu Ser Leu Thr Arg Glu Arg Leu Ala Lys Val 
1425 1430 1435 1440 

Phe Ala Pro Lys Val Thr Ala Ala Asn His Leu Asp Glu Leu Thr Arg 
1445 1450 1455 

Glu Leu Asp Leu Arg Ala Phe Val Val Phe Ser Ser Ala Ser Gly Val 
1460 1465 1470 



Phe Gly Ser Ala Gly Gin Gly Asn Tyr Ala Ala Ala Asn Ala Tyr Leu 
1475 1480 1485 
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Asp Ala Val Val Ala Asn Arg Arg Ala Ala Gly Leu Pro Gly Thr Ser 
1490 1495 1500 

Leu Ala Trp Gly Leu Trp Glu Gin Thr Asp Gly Met Thr Ala His Leu 
1505 1510 1515 1520 

Gly Asp Ala Asp Gin Ala Arg Ala Ser Arg Gly Gly Val Leu Ala He 
1525 1530 1535 

Ser Pro Ala Glu Gly Met Glu Leu Phe Asp Ala Ala Pro Asp Gly Leu 
1540 1545 1550 

Val Val Pro Val Lys Leu Asp Leu Arg Lys Thr Arg Ala Gly Gly Thr 
1555 1560 1565 

Val Pro His Leu Leu Arg Gly Leu Val Arg Pro Gly Arg Gin Gin Ala 
1570 1575 1580 

Arg Pro Ala Ser Thr Val Asp Asn Gly Leu Ala Gly Arg Leu Ala Gly 
1585 1590 1595 1600 

Leu Ala Pro Ala Glu Gin Glu Ala Leu Leu Leu Asp Val Val Arg Thr 
1605 1610 1615 

Gin Val Ala L-eu Val Leu Gly His Ala Gly Pro Glu Ala Val Arg Ala 
1620 1625 1630 

Asp Thr Ala Phe Lys Asp Thr Gly Phe Asp Ser Leu Thr Ser Val Glu 
1635 1640 1645 

Leu Arg Asn Arg Leu Arg Glu Ala Ser Gly Leu Lys Leu Pro Ala Thr 
1650 1655 1660 

Leu Val Phe Asp Tyr Pro Thr Pro Val Ala Leu Ala Arg Tyr Leu Arg 
1665 1670 1675 1680 
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Asp Glu Leu Gly Asp Thr Val Ala Thr Thr Pro Val Ala Thr Ala Ala 
1685 1690 1695 

Ala Ala Asp Ala Gly Glu Pro He Ala He Val Gly Met Ala Cys Arg 
1700 1705 1710 

Leu Pro Gly Gly Val Thr Asp Pro Glu Gly Leu Trp Arg Leu Val Arg 
1715 1720 1725 

Asp Gly Leu Glu Gly Leu Ser Pro Phe Pro Glu Asp Arg Gly Trp Asp 
1730 1735 1740 

Leu Glu Asn Leu Phe Asp Asp Asp Pro Asp Arg Ser Gly Thr Thr Tyr 
1745 1750 1755 1760 

Thr Ser Arg Gly Gly Phe Leu Asp Gly Ala Gly Leu Phe Asp Ala Gly 
1765 1770 1775 

Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin 
1780 1785 1790 

Arg Leu Leu Leu Glu Ala Ala Trp Glu Ala Leu Glu Gly Thr Gly Val 
1795 1800 1805 

Asp Pro Gly Ser Leu Lys Gly Ala Asp Val Gly Val Phe Ala Gly Val 
1810 1815 1820 

Ser Asn Gin Gly Tyr Gly Met Gly Ala Asp Pro Ala Glu Leu Ala Gly 
1825 1830 1835 * 1840 

Tyr Ala Ser Thr Ala Gly Ala Ser Ser Val Val Ser Gly Arg Val Ser 
1845 1850 1855 

Tyr Val Phe Gly Phe Glu Gly Pro Ala Val Thr He Asp Thr Ala Cys 
1860 1865 1870 

Ser Ser Ser Leu Val AJLa Met His Leu Ala Gly Gin Ala Leu Arg Gin 
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1875 1880 1885 

Gly Glu Cys Ser Met Ala Leu Ala Gly Gly Val Thr Val Met Gly Thr 
1890 1895 1900 

Pro Gly Thr Phe Val Glu Phe Ala Lys Gin Arg Gly Leu Ala Gly Asp 
1905 1910 1915 1920 

Gly Arg Cys Lys Ala Tyr Ala Glu Gly Ala Asp Gly Thr Gly Trp Ala 
1925 1930 1935 

Glu Gly Val Gly Val Val Val Leu Glu Arg Leu Ser Val Ala Arg Glu 
1940 1945 1950 

Arg Gly His Arg Val Leu Ala Val Leu Arg Gly Ser Ala Val Asn Ser 
1955 1960 1965 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin 
1970 1975 1980 

Arg Val lie Arg Arg Ala Leu Ala Gly Ala Gly Leu Glu Pro Ser Asp 
1985 1990 1995 2000 

Val Asp He Val Glu Gly His Gly Thr Gly Thr Ala Leu Gly Asp Pro 
2005 2010 2015 

He Glu Ala Gin Ala Leu Leu Ala Thr Tyr Gly Lys Asp Arg Asp Pro 
2020 2025 2030 

Glu Thr Pro Leu Trp Leu Gly Ser Val Lys Ser Asn Phe Gly His Thr 
2035 2040 2045 

Gin Ser Ala Ala Gly Val Ala Gly Val He Lys Met Val Gin Ala Leu 
2050 2055 2060 

Arg His Gly Val Met Pro Pro Thr Leu His Val Asp Arg Pro Thr Ser 
2065 2070 2075 2080 
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Gin Val Asp Trp Ser Ala Gly Ala Val Glu Val Leu Thr Glu Ala Arg 
2085 2090 2095 

Glu Trp Pro Arg Asn Gly Arg Pro Arg Arg Ala Gly Val Ser Ser Phe 
2100 2105 2110 

Gly He Ser Gly Thr Asn Ala His Leu He He Glu Glu Ala Pro Ala 
2115 2120 2125 

Glu Pro Gin Leu Ala Gly Pro Pro Pro Asp Gly Gly Val Val Pro Leu 
2130 2135 2140 

Val Val Ser Ala Arg Ser Pro Gly Ala Leu Ala Gly Gin Ala Arg Arg 
2145 2150 2155 2160 

Leu Ala Thr Phe Leu Gly Asp- Gly Pro Leu Ser Asp Val Ala Gly Ala 
2165 2170 2175 

Leu Thr Ser Arg Ala Leu Phe Gly Glu Arg Ala Val Val Val Ala Asp 
2180 2185 2190 

Ser Ala Glu Glu Ala Arg Ala Gly Leu Gly Ala Leu Ala Arg Gly Glu 
2195 2200 2205 

Asp Ala Pro Gly Leu Val Arg Gly Arg Val Pro Ala Ser Gly Leu Pro 
2210 2215 2220 

Gly Lys Leu Val Trp Val Phe Pro Gly Gin Gly Thr Gin Trp Val Gly 
2225 2230 2235 2240 

Met Gly Arg Glu Leu Leu Glu Glu Ser Pro Val Phe Ala Glu Arg He 
2245 2250 2255 

Ala Glu Cys Ala Ala Ala Leu Glu Pro Trp He Gly Trp Ser Leu Phe 
2260 2265 2270 
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Asp Val Leu Arg Gly Asp Gly Asp Leu Asp Arg Val Asp.Val Leu Gin 
2275 2280 2285 

Pro Ala Cys Phe Ala Val Met Val Gly Leu Ala Ala Val Trp Ser Ser 
2290 2295 2300 

Ala Gly Val Val Pro Asp Ala Val Leu Gly His Ser Gin Gly Glu He 
2305 2310 2315 2320 

Ala Ala Ala Cys Val Ser Gly Ala Leu Ser Leu Glu Asp Ala Ala Lys 
2325 2330 2335 

Val Val Ala Leu Arg Ser Gin Ala He Ala Ala Lys Leu Ser Gly Arg 
2340 2345 2350 

Gly Gly Met Ala Ser Val Ala Leu Gly Glu Ala Asp Val Val Ser Arg 
2355 2360 2365 

Leu Ala Asp Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala Ser Val 
2370 2375 2380 

Val He Ala Gly Asp Ala Gin Ala Leu Asp Glu Thr Leu Glu Ala Leu 
2385. 2390 2395 2400 

Ser Gly Ala Gly He Arg Ala Arg Arg Val Ala Val Asp Tyr Ala Ser 
2405 2410 2415 

His Thr Arg His Val Glu Asp He Glu Asp Thr Leu Ala Glu Ala Leu 
2420 2425 2430 

Ala Gly He Asp Ala Arg Ala Pro Leu Val Pro Phe Leu Ser Thr Leu 
2435 2440 2445 

Thr Gly Glu Trp He Arg Asp Glu Gly Val Val Asp Gly Gly Tyr Trp 
2450 2455 2460 



Tyr Arg Asn Leu Arg Gly Arg Val Arg Phe Gly Pro Ala Val Glu Ala 
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2465 2470 2475 2480 

Leu Leu Ala Gin Gly His Gly Val Phe Val Glu Leu Ser Ala His Pro 
2485 2490 2495 

Val Leu Val Gin Pro lie Thr Glu Leu Thr Asp Glu Thr Ala Ala Val 
2500 2505 2510 



Val Thr Gly Ser Leu Arg Arg Asp Asp Gly Gly Leu Arg Arg Leu Leu 
2515 2520 2525 

Thr Ser Met Ala Glu Leu Phe Val Arg Gly Val Glu Val Asp Trp Thr 
2530 2535 2540 

Ser Leu Val Pro Pro Ala Arg Ala Ajsp Leu Pro Thr Tyr Ala Phe Asp 
2545 2550 2555 2560 

His Glu His Tyr Trp Leu Arg Ala Ala Asp Thr Ala Ser Asp Ala Val 
2565 2570 2575 

Ser Leu Gly Leu Ala Gly Ala As? His Pro Leu Leu Gly Ala Val Val 
2580 2585 2590 

Gin Leu Pro Gin Ser Asp Gly Leu Val Phe Thr Ser Arg Leu Ser Leu 
2595 2600 2605 

Arg Ser His Pro Trp Leu Ala Asp His Ala Val Arg Asp Val Val lie 
2610 2615 2620 

Val Pro Gly Thr Gly Leu Val Glu Leu A-la Val Arg Ala Gly Asp Glu 
2625 2630 2635 2640 

Ala Gly Cys Pro Val Leu Asp Glu Leu Val lie Glu Ala Pro Leu Val 
2645 2650 2655 

Val Pro Arg Arg Gly Gly Val Arg Val Gin Val Ala Leu Gly Gly Pro 
2660 2665 2670 
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Ala Asp Asp Gly Ser Arg Thr Val Asp Val Phe Ser Leu Arg Glu Asp 
2675 2680 2685 

Ala Asp Ser Trp Leu Arg His Ala Thr Gly Val Leu Val Pro Glu Asn 
2690 2695 2700 

Arg Pro Arg Gly Thr Ala Ala Phe Asp Phe Ala Ala Trp Pro Pro Pro 
2705 2710 2715 2720 

Glu Ala Lys Pro Val Asp Leu Thr Gly Ala Tyr Asp Val Leu Ala Asp 
2725 2730 2735 

Val Gly Tyr Gly Tyr Gly Pro Thr Phe Arg Ala Val Arg Ala Val Trp 
2740 2745 2750 

Arg Arg Gly Ser Gly Asn Thr Thr Glu Thr Phe Ala Glu He Ala Leu 
2755 2760 2765 

Pro Glu Asp Ala Arg Ala Glu Ala Gly Arg Phe Gly He His Pro Ala 
2770 2775 2780 

Leu Leu Asp Ala Ala Leu His Ser Thr Met Val Ser Ala Ala Ala Asp 
2785 2790 2795 2800 

Thr Glu Ser Tyr Gly As? Glu Val Arg Leu Pro Phe Ala Trp Asn Gly 
2805 2810 2815 

I^u Arg Leu His Ala Ala Gly Ala Ser Val Leu Arg Val Arg Val Ala 
2820 2825 2830 

Lys Pro Glu Arg Asp Ser Leu Ser Leu Glu Ala Val Asp Glu Ser Gly 
2835 2840 2845 

Gly Leu Val Val Thr Leu Asp Ser Leu Val Gly Arg Pro Val Ser Asn 
2850 2855 2860 
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Asp Gin Leu Thr Thr Ala Ala Gly Pro Ala Gly Ala Gly Ser Leu Tyr 
2865 2870 2875 2880 

Arg Val Asp Trp Thr Pro Leu Ser Ser Val Asp Thr Ser Gly Arg Val 
2885 2890 2895 

Pro Ser Trp Leu Pro Val Ala Thr Ala Glu Glu Val Ala Thr Leu Ala 
2900 2905 2910 

Asp Asp Val Leu Thr Gly Ala Thr Glu Ala Pro Ala Val Ala Val Met 
2915 2920 2925 

I"* Glu Ala Val Ala Asp Glu Gly Ser Val Leu Ala Leu Thr Val Arg Val 

S 2930 2935 2940 



Leu Asp Val Val Gin Cys Trp Leu Ala Gly Gly Gly Leu Glu Gly Thr 
2945 2950 2955 2960 

Lys Leu Ala He Val Thr Arg Gly Ala Val Pro Ala Gly Asp Gly Val 
2965 2970 2975 

Val His Asp Pro Ala Ala Ala Ala Val Trp Gly Leu Val Arg Ala Ala 
2980 2985 2990 

Gin Ala Glu Asn Pro Asp Arg He Val Leu Leu Asp Val Glu Pro Glu 
2995 3000 3005 

Ala As? Val Pro Pro Leu Leu Gly Ser Val Leu Ala Asp Gly Glu Pro 
3010 3015 3020 

Gin Val Ala Val Arg Gly Thr Thr Leu Ser He Pro Arg Leu Ala Axg 
3025 3030 3035 3040 

Ala Ala Arg Pro Asp Pro Ala Ala Gly Phe Lys Thr Arg Gly Pro Val 
3045 3050 3055 



Leu Val Thr Gly Gly Thr Gly Ser Leu Gly Gly Leu Val Ala Arg His 
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3060 3065 3070 



Leu Val Glu Arg His Gly Val Arg Gin Leu Val Leu Ala Ser Arg Arg 
3075 3080 3085 

Gly Leu Asp Ala Glu Gly Ala Lys Asp Leu Val Thr Asp Leu Thr Ala 
30S0 3095 3100 

Leu Gly Ala A-sp Val Ala Val Ala Ala Cys Asp Val Ala Asp Arg Asp 
3105 3110 3115 3120 

Gin Val Ala Ala Leu Leu Thr Glu His Arg Pro Ser Ala Val Val His 
3125 3130 3135 

Thr AJ-a Gly Val Pro Asp Ala Gly Val He Gly Thr Val Thr Pro A^p 
3140 3145 3150 

Arg Leu AJ.a Glu Val Phe Ala Pro Lys Val Thr Ala Ala Arg His Leu 
3155 3160 3165 

Asp Glu Leu Thr Arg Asp Leu Asp Leu Asp Ser Phe Val Val Tyr Ser 
3170 3175 3180 

Ser Val Ser Ala Val Phe Met Gly Ala Gly Ser Gly Ser Tyr Ala Ala 
3185 3190 3195 3200 

Ala Asn AJ.a Tyr Leu Asp Gly Leu Met Ala His Axg Arg Ala Ala Gly 
3205 3210 3215 

Leu Pro Gly Gin Ser Leu Ala Trp Gly Leu Trp Asp Gin Thr Thr Gly 
3220 3225 3230 

Gly Met Ala AJLa Gly Thr A^p Glu Ala Gly Arg Ala Arg Met Thr Arg 
3235 3240 3245 



Arg Gly Gly Leu Val Ala Met Lys Pro Ala Ala Gly Leu Asp Leu Phe 
3250 3255 3260 
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Asp Ala Ala lie Gly Ser Gly Glu Pro Leu Leu Val Pro Ala Gin Leu 
3265 3270 3275 3280 

Asp Leu Ajcg Gly Leu Arg Ala Glu Ala Ala Gly Gly Thr Glu Val Pro 
3285 3290 3295 

His Leu Leu Arg Gly Leu Val Arg Ala Gly Arg Gin Gin Ala Arg Ala 
3300 3305 3310 

Ala Ser Thr Val Glu Glu Asn Trp Ala Gly Arg Leu Ala Gly. Leu Glu 
3315 3320 3325 

Pro Ala Glu Arg Gly Gin Val Leu Leu Glu Leu Val Arg Ala Gin Val 
3330 3335 3340 

Ala Gly Val Leu Gly Tyr Arg Ala Ala His Gin Val Asp Pro Asp Gin 
3345 3350 3355 3360 

Gly Leu Phe Glu He Gly Phe Asp Ser Leu Thr Ala He Glu Leu Arg 
3365 3370 3375 

Asn Arg Leu Arg Ala Arg Thr Glu Arg Lys He Ser Pro Gly Val Val 
3380 3385 3390 

Phe Asp His Pro Thr Pro Ala Leu Leu Ala AJLa His Leu Asn Glu Leu 
3395 3400 3405 

Leu Arg Lys Lys Val 
3410 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Ala lie Pro Tyr Ser Ser Leu Ala Tyr Glu Leu Arg Asp Ala Val 
1 5 10 15 

Asn Val Val Asp Leu Asp Glu Asp Asp Val Phe Val Thr Ser lie Ala 
20 25 30 

Glu Gly Gin Gly Gly Ala Cys Tyr His Leu Asn Arg Leu Phe His Arg 
35 40 45 

Leu Leu Thr Glu Leu Gly Tyr Asp Val Thr Pro Leu Ma Gly Ser Thr 
50 55 60 

Ala Glu Gly Arg Glu Thr Phe Gly Thr Asp Val Glu His Met Phe Asn 
65 70 75 80 

Leu Val Thr Leu Asp Gly Ala Asp Trp Leu Val Asp Val Gly Tyr Pro 
85 90 .95 

Gly Pro Thr Tyr Val Glu Pro Leu Ala Val Ser Pro Ala Val Gin Thr 
100 105 110 

Gin Tyr Gly Ser Gin Phe Arg Leu Val Glu Gin Glu Thr Gly Tyr Ala 
115 120 125 

Leu Gin Arg Arg Gly Ala Val Thr Arg Trp Ser Val Val Tyr Thr Phe 
130 135 140 



Thr Thr Gin Pro Ajrg Gin Trp Ser Asp Trp Lys Glu Leu Glu Asp Asn 



145 150 155 160 

Phe Arg Ala Leu Val Gly Asp Thr Thr Arg Thr Asp Thr Gin Glu Thr 
165 170 175 

Leu Cys Gly Arg Ala Phe Ala Asn Gly Gin Val Phe Leu Arg Gin Arg 
180 185 190 

Arg Tyr Leu Thr Val Glu Asn Gly Arg Glu Gin Val Arg Thr He Thr 
195 200 205 

Asp Asp Asp Glu Phe Arg Ala Leu Val Ser Arg Val Leu Ser Gly Asp 
210 215 220 



His Gly 
225 



